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POLYMORPHIC OLFACTORY RECEPTOR GENES AND ARRAYS, KITS AND 
METHODS UTILIZING INFORMATION DERIVED THEREFROM FOR 
GENETIC TYPING OF INDIVIDUALS 

5 FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to polymorphic olfactory receptor genes and 
more particularly, to arrays, kits and methods utilizing information derived from these 
polymorphic sequences for genetic typing of individuals. 

Olfactory transduction begins with the binding of an odorant ligand to a protein 

10 receptor on the olfactory neuron cell surface, thus initiating a cascade of reactions 
which results in the production of a second messenger and eventual depolarization of 
the cell membrane. This relatively straightforward and common signalling pathway is 
complicated by the fact that there are several thousand odorants, mostly low molecular 
weight organic molecules, and nearly one thousand different receptors. 

15 ORs are members of the superfamily of membrane receptors characterized 

structurally by possessing seven transmembrane spanning helices, and functionally by 
being coupled to GTP-binding proteins. Although OR genes make up the largest sub- 
family of G-protein coupled receptors (GPCRs) most vertebrate odorant receptors are 
classified as "orphan" receptors having no identified ligand. 

20 It is now clear that the members of this gene family have common ancestral 

origins which have undergone considerable divergence throughout evolution. Strong 
selective pressures have caused expansion and diversification of the OR gene 
repertoire, thereby modifying and honing the sense of smell in mammals. 

The diversified OR repertoire of mammals is needed in order to allow 

25 individuals to detect and discriminate between thousands of different odorant 
molecules. One of the most surprising features of human olfaction is that >60% of the 
OR genes bear one or more sequence disruptions, likely resulting in the inactivation of 
the encoded protein. There are indications that such massive OR pseudogenization is a 
relatively recent genomic process, likely to still be ongoing. Specific loss of OR 

30 genes resulted in an ensemble of only -400 functional ORs in humnas, likely leading 
to a decay in some aspects of olfactory faculties. 

It is well known that human olfactory thresholds exhibits a very high degree of 
inter-individual variability. At one extreme is specific anosmia, an individual's 
incapacity to perceive an odorant detectable by many others. Less pronounced cases 
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are termed specific hyposmia, which are characterized by an increased threshold 
(diminished sensitivity) to a particular odorant in certain members of the population. 
At the other end of the spectrum is specific hyperosmia, enahnced sensitivity to a 
specific odorant. 

5 These phenomena can also be described in terms of quantitative traits, whereby 

the human population displays a threshold distribution in which specific anosmia and 
hyperosmia form the two extreme ends of the distribution. Because many dozens of 
odorants manifest such variability, it turns out that practically every human being is 
specifically anosmic or hyperosmic to one or more odorants. 

10 Olfactory receptors interact with a diverse array of volatile molecules. It is 

widely accepted that every odorous molecule binds to several ORs and vise versa. This 
binding pattern generates a unique combinatorial code that generates a specific aroma 
for each odorant and enables the organism to distinguish it from other molecules. This 
system is highly sensitive and allows to discriminate between two protein isomers and 

15 at times even between two optical enantiomers. 

It was suggested that odorant binding patterns correspond to a particular 
receptor affinity-binding distribution (RAD) [Lancet et al. ( 1993) PNAS (30) 3715- 
3719]. According to this model, the probability that an odorant will bind to its 
receptor(s) can be described through a distribution of binding affinities. Most of these 

20 binding affinities are weak, and only few have biological significance of which, the 
strongest affinity receptor determines the odorant threshold sensitivity. Thus, if such a 
receptor is missing from a receptor repertoire of an individual, the threshold would be 
defined by the next strongest affinity, to an existing receptor. In the case of a large 
receptor e nsemble, a t ypical s cenario w ould b e that an o dorant w ould b e b ound b y 

25 several ORs with closely spaced affinity values. Consequently, it could lead to such a 
high level of functional redundancy, that a loss of an odorant highest-affinity receptor 
is unlikely to generate a recognizable olfactory deficit. This appears to be the case in 
macrosmatic organisrps, (e.g. the mouse) where relatively few cases of clearly 
discernible threshold variations have been reported. 

30 In sharp contrast, in humans, where the OR repertoire has diminished 

significantly in size, affinity values would tend to be more widely spaced, and 
threshold variations would become prevalent. In these cases, an excessive decrease of 
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specific odorant detection is expected in individuals who have lost the receptor 
exhibiting the strongest binding to a particular odorant. 

Such a hypothesis suggests that specific sensitivity to odorants present at below 
threshold concentrations may be treated as a single gene trait. In addition to such 
quantitative effects, it is worthwhile to consider the effect of a single gene inactivation 
on our qualitative olfactory cues. If a unique aroma of a given odorant is a product of 
signals from various ORs, a single change in this OR spectrum could modify the way 
the odorant signal is interpreted in the brain. 

While reducing the present invention to p ractice, the present inventors have 
uncovered polymorphic OR genes which exhibit loss or reduced function in receptor 
capacity. The present inventors have also demonstrated that the occurrence of these 
allelic variations differ in individuals from different ethnic backgrounds thereby 
suggesting that polymorphism in OR genes contributes to differences in smell 
perception of individuals. 

SUMMARY OF THE INVENTION 

According to one aspect of the present invention there is provided an 
oligonucleotide comprising a nucleic acid sequence selected suitable for identifying an 
olfactory receptor gene or an allelic variant thereof, the olfactory receptor gene being 
selected from the group consisting of SEQ ID NOs: 79-104. 

According to further features in preferred embodiments of the invention 
described below, the oligonucleotide further comprising a detectable moiety attached 
to the nucleic acid sequence. 

According to still further features in the described preferred embodiments the 
detectable m oiety i s s elected from the group consisting o f a dye, a fluorophore, an 
enzyme, a ligand and a radioisotope. 

According to still further features in the described preferred embodiments the 
oligonucleotide is selected from the group consisting of SEQ ID NOs: 1-78. 

According to still further features in the described preferred embodiments the 
oligonucleotide is an SNP-specific oligonucleotide. 

According to still further features in the described preferred embodiments the 
oligonucleotide is a primer extension oligonucleotide. 
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According to another aspect of the present invention there is provided a kit for 
identifying an olfactory receptor gene and/or an allelic variant thereof, the kit 
comprising at least oligonucleotide having a nucleic acid sequence selected suitable 
for identifying the olfactory receptor gene and/or the allelic variant thereof. 
5 According to still further features in the described preferred embodiments the 

kit bfurther comprising reagents suitable for detecting identification of the olfactory 
receptor gene and/or the allelic variant thereof by the at least one oligonucleotide. 

According to still further features in the described preferred embodiments the 
kit further comprising packaging material identifying the at least oligonucleotide as 
10 being utilizable in detecting the olfactory receptor gene and/or the allelic variant 
thereof. 

According to still further features in the described preferred embodiments the 
at least one oligonucleotide is selected from the group consisting of SEQ ID NOs: 1- 
78. 

15 According to still ftirther features in the described preferred embodiments the 

at least oligonucleotide includes a detectable moiety attached to the nucleic acid 
sequence. 

According to still further features in the described preferred embodiments the 
detectable m oiety i s s elected from the group consisting o f a d ye, a f luorophore, an 
20 enzyme, a ligand and a radioisotope. 

According to yet another aspect of the present invention there is provided an 
array for detecting the presence or absence of at least one allelic variant of an olfactory 
receptor gene in a subject, the array comprising at least one oligonucleotide being 
contained in or attached to a support, the at least oligonucleotide having a nucleic acid 
25 sequence selected suitable for specifically identifying the at least one allelic variant of 
the olfactory receptor gene. 

According to still further features in the described preferred embodiments the 
array further comprising at least one additional oligonucleotide having a nucleic acid 
sequence selected suitable for specifically identifying the olfactory receptor gene. 
30 According to still further features in the described preferred embodiments the 

olfactory receptor gene is selected from the group consisting of SEQ ID NOs: 79-104. 
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According to still further features in the described preferred embodiments the 
at least one oligonucleotide is selected from the group consisting of SEQ ID NOs: 27- 
78. 

According to still another aspect of the present invention there is provided an 
5 array for typing a subject according to presence or absence of allelic variants of 
olfactory receptor genes, the array comprising a plurality of oligonucleotides each 
being attached to a support, the plurality of oligonucleotides include at least one typing 
oligonucleotide having a sequence selected suitable for specifically identifying 
presence or absence of a specific allelic variant of a specific olfactory receptor gene in 
10 the subject. 

According to still further features in the described preferred embodiments the 
plurality of oligonucleotides also include at least one reference oligonucleotide having 
a sequence selected suitable for specifically identifying the specific olfactory receptor 
gene. 

15 According to still further features in the described preferred embodiments the 

support is a chip. 

According to still further features in the described preferred embodiments the 
at least one typing oligonucleotide is selected from the group consisting of SEQ ID 
NOs: 27-78. 

20 According to still further features in the described preferred embodiments 

wherein the at least one reference oligonucleotide is selected from the group consisting 
ofSEQIDNOs: 1-26. 

According to an additional aspect of the present invention there is provided a 
method of typing a subject according to presence or absence of allelic variants of 
25 olfactory receptor genes, the method comprising detecting the presence or absence of 
at least one allelic variant of an olfactory receptor gene in a biological sample of the 
subject thereby typing the subject. 

According to still further features in the described preferred embodiments 
further comprising detecting the presence or absence of the olfactory receptor gene in 
30 the biological sample of the subject. 

According to still further features in the described preferred embodiments the 
olfactory receptor gene is selected from the group consisting of SEQ ID NOs: 79-104. 
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According to still further features in the described preferred embodiments the 
detecting the presence or absence of at least one allelic variant of the olfactory 
receptor gene is effected using at least one oligonucleotide selected from the group 
consisting of SEQ ID NOs: 27-78. 
5 According to still further features in the described preferred embodiments the 

detecting the presence or absence of the olfactory receptor gene is effected using at 
least one oligonucleotide selected from the group consisting of SEQ ID NOs: 1-26. 

According to still further features in the described preferred embodiments the 
detecting the presence or absence of at least one allelic variant of the olfactory 
10 receptor gene is effected by detecting DNA and/or mRNA sequences. 

According to still further features in the described preferred embodiments the 
detecting the presence or absence of at least one allelic variant is effected using at least 
one oligonucleotide selected from the group consisting of SEQ ID NOs: 27-78. 

According to still further features in the described preferred embodiments the 
1 5 detecting the presence or absence of the olfactory receptor gene is effected using at 
least one oligonucleotide selected from the group consisting of SEQ ID NOs: 1-26. 

According to yet an additional aspect of the present invention there is provided 
a nucleic acid construct comprising a polynucleotide encoding an olfactory receptor 
gene and/or an allelic variant thereof, the olfactory receptor gene being selected from 
20 the group consisting of SEQ ID Nos: 79-1 04 and a promoter for directing transcription 
of the olfactory receptor gene or the allelic variant thereof in a cell. 

According to still an additional aspect of the present invention there is 
provided a cell comprising the nucleic acid construct. 

According to still further features in the described preferred embodiments the 
25 cell is a mammalian cell. 

The present invention successfully addresses the shortcomings of the presently 
known configurations by providing polymorphic olfactory receptor genes and arrays, 
kits and methods utilizing information derived from these polymorphic sequences for 
genetic typing of individuals. 
30 Unless otherwise defined, all technical and scientific terms used herein h ave 

the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
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methods and materials are described below. In case of conflict, the patent 
specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with reference to 
the accompanying drawings. With specific reference now to the drawings in detail, it 
is stressed that the particulars shown are by way of example and for purposes of 
illustrative discussion of the preferred embodiments of the present invention only, and 
10 are presented in the cause of providing what is believed to be the most useful and 
readily understood description of the principles and conceptual aspects of the 
invention. In this regard, no attempt is made to show structural details of the invention 
in more detail than is necessary for a fundamental understanding of the invention, the 
description taken with the drawings making apparent to those skilled in the art how the 
15 several forms of the invention may be embodied in practice. 
In the drawings: 

FIG. 1 illustrates the olfactory receptor (OR) gene cluster on human 
chromosome 17pl3.3. Genes are positioned in their correct order and orientation, with 
exons marked as triangles and introns marked as squares. The regions that were 

20 sequenced are filled. The serial numbers of individual SNPs analyzed within each OR 
gene are shown inside ovals and are corresponding to the list detailed elsewhere 
(http://bioinfo.weizmami.ac.il/-menashe/ORl 7 SNPs.htmlV The span of the three 
intervals used for Clark's algorithm is indicated with Roman numerals I-IIL 

FIG. 2 illustrates the hapiotypes of 30 diploid individuals, as marked in the left 

25 column. The 40 polymorphic sites are indicated by their position in the different genes 
on the top row. The rare variant is represented by a dark square. The intact and 
disrupted alleles of the putatively functional SNPs are marked by green and red 
squares, respectively. The segregating pseudogene hapiotypes are shown in order of 
their increasing disruption level from top to bottom. The normalized fraction of each 

30 haplotype in the Pygmies was as follows: I, 0.1; II. 0.0; III. 0.6; IV. 1.0; V. 0.9; VI. 
0.9; VH. 0.9. 

FIG. 3 illustrates a linkage disequilibrium diagram across the OR cluster. SNP 
numbers are as in Figures 1 and 2. Dark blue squares indicate pairs in significant LD 
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indicated by both the Clark and EM algorithms. Other colors indicate pairs in 
significant LD indicated only by one algorithm: Clark's (light blue) or EM (yellow). 
The significance level in all pairs labeled in color was p < 0.05 using Fisher's exact 
test. T he p osition o f the SNPs relative to the OR genes and the Clark's algorithm 
5 analysis intervals are indicated near the diagonal. 

FIGs. 4a-e are scatter-plots of the decay of LD with physical distance. Each 
point is the D -value for a pair of sites separated by a physical distance indicated on 
the abscissa. A linear-fit trend-line for the data is plotted. The significance of the data 
was c alculated u sing a permutation test. The d ecay o f LD for t he e ntire s ample i s 

10 shown at a distance of 124 Kb (R = -0.53, P < 0.01). Pairs with significant (P < 0.05) 
D -pairs are in red triangles (Figure 4a). 

FIGs. 4b-e illustrate the decay of LD for the specific ethnogeographic groups: 
The Pygmies (Figure 4b), the Yemenite Jews (Figure 4c), the Bedouins (Figure 4d) 
and the Ashkenazi Jews (Figure 4e). 

15 FIGs. 5a-b illustrate the observed individual OR genotypes in African- 

Americans (Figure 5a) and non-Africans (Figure 5b). Red indicates homozygous OR 
disruption, dark green denotes homozygous intact OR and light green is for 
heterozygotes. Both individuals (rows) and OR loci (columns) are ordered according 
to their disruption level. Disruption type is indicated on top: N - nonsense mutation, 

20 D - Deletion/Insertion and S - mis-sense substitution at a highly conserved site. The 
178 putative phenotypes seen in the 189 individuals based on recessive inheritance are 
in accordance with a computed expectation value assuming Hardy-Weinberg 
equilibrium and based on the allele frequencies present. 

FIG. 6 illustrates population differences in the frequency of intact OR alleles. 

25 The values for Caucasians are plotted as dark bars, and those for the Pygmies as empty 
bars. OR genes are indicated by their official Human Genome Organization human 
gene nomenclature symbols (Glusman et al. 2000). 

FIG. 7 illustrates the phenotype distribution of the OR gene cluster for different 
ethnic groups. Shown are the population frequencies of the counts of deduced 

30 functional loci in African Americans (light bars) and non-Africans (dark bars). OR 
loci were considered intact if the individual carried at least one copy of the intact 
allele. The broken line curves are Gaussian fits with \l = 20.06 and a = 1.73 for 
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African Americans and jx = 19.40 and a = 1.61 for non- Africans. The statistical 
significance of the difference in the number of intact ORs between African- American 
and non- African individuals was assessed by a Man-Whitney-U test, P = 0.0022, or by 
a £ test, P — 0.014. A comparable P-value was obtained also when the functional 
5 allele was not assumed as dominant and the three different genotypes were not used at 
each locus (Man-Whitney-U test, P = 0.0001, or j? % P = 0.015). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of polymorphic olfactory genes and arrays, kits and 

10 methods utilizing information derived therefrom for typing the olfactory sensitivity of 
individuals or populations. 

The principles and operation of the present invention may be better understood 
with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it is to be 

15 understood that the invention is not limited in its application to the details set forth in 
the following description or exemplified by the Examples. The invention is capable of 
other embodiments or of being practiced or carried out in various ways. Also, it is to 
be understood that the phraseology and terminology employed herein is for the 
purpose of description and should not be regarded as limiting. 

20 Evidence collected over the past decade suggests a clear genetic basis for 

human olfactory threshold variations. Characterization of human OR genes over the 
past decade has paved the way for creating a direct link between olfactory phenotypes 
and OR genotypes. However, to date, such linkage has not been established. 

While reducing the present invention to practice, the present inventors have 

25 uncovered a set of novel OR genes (SEQ ID NOs: 79-104) in which a single 
nucleotide polymorphism (SNP, i.e., variation from the most frequently occurring base 
at a particular nucleic acid position) segregates between intact and disrupted alleles in 
the human population (see Example 2 of the Examples section). 

These alternative forms of the genes, hereinafter also referred to as allelic 

30 variants, exhibit remarkable diversity among human population (see the Examples 
section hereinbelow) by which every human is characterized with a unique set of intact 
and disrupted ORs. More intriguingly, significant differences in intact ORs count 
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were observed between different populations suggesting that the inter-individual 
olfactory differences might be more notable among groups of diverse ethnicities. 

As further described in the Examples section which follows, the present 
inventors employed two screening approaches to find human polymorphic OR genes. 
5 In the first approach, OR pseudogenes that have only one open reading frame 
disruption were s ought [ Glusman (2001) Genome Res. 1 1:685-702]. F ifty of these 
sequences were sequenced in a chimpanzee since differential pseudogene state 
between the two higher apes would suggest recent evolutionary events which might 
generate human polymorphism. The second approach included querying Celera's 
10 human SNP database for genetic variations, which might affect protein integrity. 51 
OR loci obtained from these screenings where genotyped in 189 individuals from 
several ethnic origins (see Example 1). 

As illustrated in the Example section which follows, 26 OR loci were found to 
segragate between an intact and disrupted alleles in the study group, giving rise to an 
1 5 unprecedented number of unique genotypic patterns i.e., haplotypes. 

Extrapolation has estimated the number of such segregating pseudogenes 
(SPGs) in the entire human genome at approximately 60 genes, which cover 
approximately 15% of the human functional OR repertoire. This number of SPGs in 
rough agreement with the reported count of different modes of human odorant-specific 
20 sensory deficits. 

Thus, the presently discovered genotypic disparity is an important tool in the 
elucidation and possibly characterization of olfactory variation in humans. 

Thus, according to one aspect of the present invention there is provided a 
method of typing a subject according to presence or absence of allelic variants of the 
25 olfactory receptor genes of the present invention. 

The method i s e ffected by detecting the presence o r absence o f at 1 east one 
allelic variant of an olfactory receptor gene in a biological sample of the subject, 
thereby typing the subject. 

As used herein the term "subject" refers to a mammalian subject which is 
30 preferably a human. 

As used herein, the phrase "biological sample" refers to a sample of tissue or 
fluid isolated from a individual, including but not limited to, for example, plasma, 
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serum, spinal fluid, lymph fluid, the external sections of the skin, respiratory, 
intestinal, and genitourinary tracts, tears, saliva, milk, semen and organs. 

Preferably, the biological sample of this aspect of the present invention is 
obtained from the olfactory neuroephithlium, located at the upper area of each nasal 
5 chamber adjacent to the cribriform plate, superior nasal septum, and superior-lateral 
nasal wall. Methods of obtaining olfactory epithelium are well known in the art. 

Detecting the presence or absence of at least one allelic variant of an olfactory 
receptor gene according to this aspect of the present invention can be effected at at the 
nucleic acid sequence level of the OR gene (i.e., DNA or transcribed RNA) or at the 

10 protein level, i.e. polynucleotides expressed from the OR gene. 

Numerous hybridization b ased t echniques are known in the art for detecting 
nucleic acid sequence variations. A given nucleic acid sequnece or any number of 
sequences can be detected by hybridization to a specific probe. Such probes may be 
cloned DNAs or fragments thereof, RNA, typically made by in- vitro transcription, or 

15 oligonucleotides; oligonucleotides can also be used as primers in amplification based 
detection approaches (i.e., PCR). 

As used herein, the term "oligonucleotide" refers to a single stranded or double 
stranded oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid 
(DNA) or mimetics thereof. This term includes oligonucleotides composed of 

20 naturally-occurring bases, sugars and covalent internucleoside linkages (e.g., 
backbone) as well as oligonucleotides having non-naturally-occurring portions which 
function similarly. Such modified or substituted oligonucleotides are often preferred 
over native forms because of desirable properties such as, enhanced affinity for nucleic 
acid target and increased stability in the presence of nucleases. 

25 The oligonucleotides of the present invention preferably include nucleic acid 

sequences that are substantially homologous to nucleic acid sequences that flank 
and/or extend across the SNPs of the present invention (see Table 5 of the Examples 
section). 

Examples of oligonucleotides which can be used to identify both ORs of the 
30 present invention and allelic variants thereof are presented in Table 6 of the Examples 
section which follows (SEQ ID NOs. 1-78). 

Oligonucleotides generated by the teachings of the present invention may be 
used in any modification of nucleic acid hybridization based techniques. As such the 
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oligonucleotides of the present invention can c orrespond to any cDNA, mRNA and 
genomic sequences regions which stretch across 10 bp, 20 bp, 30 base pairs (bp), or 
even 40, 50, or 100 bp, or longer. Oligonucleotides of 10 to 1000 bp or even more, 
may have utility as hybridization probes in a variety of hybridization techniques 

5 including Southern and Northern blotting. The total size of oligonucleotide used, as 
well as the size of complementary sequences depend on the intended use or type of 
detection employed. 

In general, the oligonucleotides of the present invention may be generated by 
any oligonucleotide synthesis method known in the art such as enzymatic synthesis or 

10 solid phase synthesis. Equipment and reagents for executing solid-phase synthesis are 
commercially available from, for example, Applied Biosystems. Any other means for 
such synthesis may also be employed; the actual synthesis of the oligonucleotides is 
well within the capabilities of one skilled in the art and as such is not further described 
herein. 

15 The oligonucleotides of the present invention may comprise heterocylic 

nucleosides consisting of purines and the pyrimidines bases, bonded in a 3 f to 5' 

phosphodiester linkage. 

Preferably oligonucleotides utilized by the present invention are those modified 

in either backbone, internucleoside linkages or bases, as is broadly described 
20 hereinunder. Such modifications can oftentimes facilitate oligonucleotide uptake and 

resistivity to intracellular conditions. 

Examples of oligonucleotides which can be used with this aspect of the present 

invention include oligonucleotides containing modified backbones or non-natural 

internucleoside linkages. Oligonucleotides having modified backbones include those 
25 that retain a phosphorus atom in the backbone. U.S. Pat. NOs: ,687,808; 4,469,863; 

4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 

5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466, 677; 

5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 

5,587,361; and 5,625,050 disclose oligonucleotide synthesis approaches which can be 
30 utilized by the present invention. 

Preferred modified oligonucleotide backbones include, for example, 

phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 

aminoalkyl phosphotriesters, methyl and other alkyl phosphonates including 3- 
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alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3 -amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 
boranophosphates having normal 3 f -5* linkages, 2 , -5 l linked analogs of these, and those 
5 having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3 f -5' 
to or 2 , -5* to 5'-2\ Various salts, mixed salts and free acid forms can also be used. 

Alternatively, modified oligonucleotide backbones include backbones that are 
formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom 
and alkyl or cycloalkyl internucleoside linkages, or one or more short chain 

10 heteroatomic or heterocyclic internucleoside linkages. These include those having 
morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane 
backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl 
backbones; methylene formacetyl and thioformacetyl backbones; alkene containing 
backbones; sulfamate backbones; methyleneimino and methylenehydrazino 

15 backbones; sulfonate and sulfonamide backbones; amide backbones; and others having 
mixed N, O, S and CH 2 component parts, as disclosed in U.S. Pat. Nos. 5,034,506; 
5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 
5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 
5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623, 

20 070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439. 

Other oligonucleotides which can be used according to the present invention, 
are those modified in both sugar and the internucleoside linkage, i.e., the backbone, of 
the nucleotide units are replaced with novel groups. The base units are maintained for 
complementation with the appropriate polynucleotide target. An example for such an 

25 oligonucleotide mimetic, includes peptide nucleic acid (PNA). A PNA 
oligonucleotide refers to an oligonucleotide where the sugar-backbone is replaced with 
an amide containing backbone, in particular an aminoethylglycine backbone. The 
bases are retained and are bound directly or indirectly to aza nitrogen atoms of the 
amide portion of the backbone. United States patents that teach the preparation of 

30 PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; 
and 5,71 9,262, e ach o f which is h erein i ncorporated b y r eference. O ther b ackbone 
modifications, which can be used in the present invention are disclosed in U.S. Pat. 
No: 6,303,374. 
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Oligonucleotides of the present invention may also include base modifications 
or substitutions. As used herein, "unmodified" or "natural" bases include the purine 
bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) 
and uracil (U). Modified bases include but are not limited to other synthetic and 

5 natural bases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, 
hypoxanthine, 2 -aminoadenine, 6 -methyl and o ther a lkyl d erivatives o f adenine and 
guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2- 
thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and 
cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8- 

10 halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and 
guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted 
uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3- 
deazaadenine. Further bases include those disclosed in U.S. Pat. No: 3,687,808, those 

15 disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 
858-859, Kroschwitz, J. L, ed. John Wiley & Sons, 1990, those disclosed by Englisch 
et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed 
by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, 
Crooke, S. T. and Lebleu, B. , ed., CRC Press, 1993. Such bases are particularly 

20 useful for increasing the binding affinity of the oligomeric compounds of the 
invention. T hese include 5 -substituted p yrimidines, 6 -azapyrimidines and N -2, N -6 
and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- 
propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic 
acid duplex stability by 0.6-1.2°C. [Sanghvi YS et al. (1993) Antisense Research and 

25 Applications, CRC Press, Boca Raton 276-278] and are presently preferred base 
substitutions, even more particularly when combined with 2-O-methoxyethyl sugar 
modifications. 

It will be appreciated that it is not necessary for all positions in a given 
oligonucleotide molecule to be uniformly modified, and in fact more than one of the 
30 aforementioned modifications may be incorporated in a single compound or even at a 
single nucleoside within an oligonucleotide. 

The oligonucleotides of the present invention are contacted with the biological 
sample to generate oligonucleotide-nucleic acid sequence specific hybrids. Contacting 
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the oligonucleotides of the present invention with the biological sample is effected by 
stringent, moderate or mild hybridization (as used in any polynucleotide hybridization 
assay such as northern blot, dot blot, RNase protection assay, RT-PCR and the like). 
Wherein stringent hybridization is effected by a hybridization solution of 6 x SSC and 
5 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 
0.5 % SDS, 100 ng/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, 
hybridization temperature of 1 - 1.5 °C below the T m , final wash solution of 3 M 
TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 
1.5 °C below the T m ; moderate hybridization is effected by a hybridization solution of 
10 6 x SSC and 0.1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM 
EDTA (pH 7.6), 0.5 % SDS, 1 00 ng/ml denatured salmon sperm DNA and 0.1 % 
nonfat dried milk, hybridization temperature of 2 - 2.5 °C below the T m , final wash 
solution of 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 
0.5 % SDS at 1 - 1.5 °C below the T m , final wash solution of 6 x SSC, and final wash 

15 at 22 °C; whereas mild hybridization is effected by a hybridization solution of 6 x SSC 
and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 
7.6), 0.5 % SDS, 100 [ig/m\ denatured salmon sperm DNA and 0.1 % nonfat dried 
milk, hybridization temperature of 37 °C, final wash solution of 6 x SSC and final 
wash at 22 °C 

20 In general, quantifying hybridization complexes is well known in the art and 

may be achieved by any one of several approaches. These approaches are generally 
based on the detection of a label or marker, such as any radioactive, fluorescent, 
biological or enzymatic tags or labels of standard use in the art. A label can be applied 
on either the oligonucleotide probes or nucleic acids derived from the biological 

25 sample. 

The following illustrates a number of labeling methods suitable for use in the 
present invention. For example, oligonucleotides of the present invention can be 
labeled subsequent to synthesis, by incorporating biotinylated dNTPs or rNTP, or 
some similar means (e.g., photo-cross-linking a psoralen derivative of biotin to 
30 RNAs), followed by addition of labeled streptavidin (e.g., phycoerythrin-conjugated 
streptavidin) or the equivalent. Alternatively, when fluorescently-labeled 
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oligonucleotide probes are used, fluorescein, lissamine, phycoerythrin, rhodamine 
(Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and 
others [ e.g., Kricka et al. (1992), Academic Press San Diego, Calif] can be attached to 
the oligonucleotides. It will be appreciated that pairs of fluorophores are chosen when 
5 distinction between two emission spectra of two oligonucleotides is desired or 
optionally, a label other than a fluorescent label is used. For example, a radioactive 
label, or a pair of radioactive labels with distinct emission spectra, can be used [Zhao 
et al. (1995) Gene 156:207]. However, because of scattering of radioactive particles, 
and the consequent requirement for widely spaced binding sites, the use of 
10 fluorophores rather than radioisotopes is more preferred. 

The intensity of signal produced in any of the detection methods described 
hereinabove may be analyzed manually or using hardware and software suited for such 
purposes. 

Alternatively, detection of allelic variants according to this aspect of the 

15 present invention can be effected at the protein level provided that the allelic variation 
is expressed in the amino acid sequence of the OR. For example, OR1E3P (SEQ ID 
NO: 84) includes a single base deletion in nucleotide coordinate 54 which causes a 
frame shift resulting in a premature stop codon. Such variation can be detected at the 
protein level based on, for example, electrophoretic mobilization, N-terminal Edman 

20 sequencing or antibody recognition. OR3A1 (SEQ ID NO: 89) is another example 
wherein a single nucleotide substitution (G > A) results in an amino acid substitution 
(Arginine > Glutmaine) in a highly conserved region of the ORs [i.e., the DRY motif 
Reich (1998) Proc. Natl. Acad. Sci. USA 95:8119-23; Risch (1996) Science 273:1516- 
7; Rouquier(2000) Proc. Natl. Acad. Sci. USA 97:2870-4]. In this case too, such 

25 sequence variation can be detected using a specific antibody. 

Polypeptide sequences can be extracted from the biological sample using a 
variety of methods which are well known to the ordinary skilled in the art. The protein 
can be isolated by conventional means of protein biochemistry and purification to 
obtain a substantially pure product, i.e., 80, 95 or 99% free of cell component 

30 contaminants, as described in Jacoby, Methods in Enzymology Volume 104, 
Academic Press, New York (1984); Scopes, Protein Purification, Principles and 
Practice, 2 nd Edition, Springer- Verlag, New York (1987); and Deutscher (ed), Guide to 
Protein Purification, Methods in Enzymology, Vol. 182 (1990). 
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As used herein the term "antibody", refers to an intact antibody molecule and 
the phrase "antibody fragment" refers to a functional fragment thereof, such as Fab, 
F(ab , )2, and Fv that are capable of binding to macrophages. These functional antibody 
fragments are defined as follows: (i) Fab, the fragment which contains a monovalent 
5 antigen-binding fragment of an antibody molecule, can be produced by digestion of 
whole antibody with the enzyme papain to yield an intact light chain and a portion of 
one heavy chain; (ii) Fab', the fragment of an antibody molecule that can be obtained 
by treating whole antibody with pepsin, followed by reduction, to yield an intact light 
chain and a portion of the heavy chain; two Fab' fragments are obtained per antibody 
10 molecule; (iii) (Fab% the fragment of the antibody that can be obtained by treating 
whole antibody with the enzyme pepsin without subsequent reduction; F(ab')2 is a 
dimer of two Fab' fragments held together by two disulfide bonds; (iv) Fv, defined as a 
genetically engineered fragment containing the variable region of the light chain and 
the variable region of the heavy chain expressed as two chains; (v) Single chain 
15 antibody ("SCA"), a genetically engineered molecule containing the variable region of 
the light chain and the variable region of the heavy chain, linked by a suitable 
polypeptide linker as a genetically fused single chain molecule; and (vi) Peptides 
coding for a single complementarity-determining region (CDR).. 

Methods of generating antibodies (i.e., monoclonal and polyclonal) are well 
20 known in the art. Antibodies may be generated via any one of several methods known 
in the art, which methods can employ induction of in vivo production of antibody 
molecules, screening immunoglobulin libraries or panels of highly specific binding 
reagents as disclosed [Orlandi D.R. et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837, 
Winter G. et al. (1991) Nature 349:293-299] or generation o f monoclonal antibody 
25 molecules by continuous cell lines in culture. These include but are not limited to, the 
hybridoma technique, the human B-cell hybridoma technique, and the Epstein-Bar- 
Virus (EBV)-hybridoma technique [Kohler G., et al. (1975) Nature 256:495-497, 
Kozbor D., et al. (1985) J. Immunol. Methods 81:31-42, Cote RJ. et al. (1983) Proc. 
Natl. Acad. Sci. 80:2026-2030, Cole S.P. et al. (1984) Mol. Cell. Biol. 62:109-120]. 
30 According to preferred embodiments of this aspect of the present invention, 

subject typing is effected using a plurality of oligonucleotides or antibodes which are 
attached to a solid substrate configured as a microarray. Microarrays are known in the 
art and consist of a surface to which probes that correspond in sequence to gene 
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products (e.g., cDNAs, mRNAs, cRNAs, polypeptides, and fragments thereof), can be 
specifically hybridized or bound at a known position (i.e., regiospecificity). 

Several methods for attaching the oligonucleotides to a microarray are known 
in the art including but not limited to glass-printing, described generally by Schena et 
5 al., 1995, Science 270:467-47, photolithographic techniques [Fodor et al. (1991) 
Science 251:767-773], inkjet printing, masking and the like. 

Antibody arrays are also known in the art and disclosed in U.S. Pat. No. 
6,329,209. 

Since the allelic variants described herein may exhibit modified odorant 
10 specificity, it may be advantageous to type the odorant ligand of such ORs. Thus, the 
present invention also envisages nucleic acid constructs which include the 
polymorphic OR sequences of the present invention and may be used to express these 
sequences in a variety of host cells. 

Cloning of the OR sequences of the present invention into nucleic acid 
15 expression constructs can be effected using commercially available eukaryotic 
expression vectors or derivatives thereof. Examples of suitable vectors include, but 
are not limited to pcDNA3, pcDNA3.1 (+/-), pGL3, PzeoSV2 (+/-), pSecTag2, 
pDisplay, pEF/myc/cyto, pCMV/myc/cyto, pDR3.1, pSinRep5, DH26S, DHBB, 
pNMTl, pNMT41, pNMT81, which are available from Invitrogen, pMbac, pPbac, 
20 pBK-RSV and pBK-CMV which are available from Stratagene, pTRES which is 
available from Clontech. 

Any promoter and/or regulatory sequences included in the expression vectors 
described above can be utilized to direct the transcription of the OR genes of the 
present invention. 

25 Preferably, the promoter that is selected according to the host cells or tissues of 

interest. Examples of cell type-specific and/or tissue-specific promoters include 
promoters such as albumin that is liver specific [Pinkert et al., (1987) Genes Dev. 
1:268-277], lymphoid specific promoters [Calame et al., (1988) Adv. Immunol. 
43:235-275]; in particular promoters of T-cell receptors [Winoto et al., (1989) EMBO 

30 J. 8:729-733] and immunoglobulins; [Banerji et al. (1983) Cell 33729-740], neuron- 
specific promoters such as the neurofilament promoter [Byrne et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:5473-5477], pancreas-specific promoters [Edlunch et al. (1985) 
Science 230:912-916] or mammary gland-specific promoters such as the milk whey 
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promoter (U.S. Pat. No. 4,873,316 and European Application Publication No. 
264,166). Promoters for expression of the polynucleotide can also be 
developmentally-regulated promoters such as the murine homeobox promoters [Kessel 
et al. (1990) Science 249:374-379) or the fetoprotein promoter [Campes et al. (1989) 
5 Genes Dev. 3:537-546]. 

The nucleic acid construct can be introduced into the cell via any 
transformation method known in the art. 

Once expressed, OR sequences of the present invention may be used to identify 
their congnate ligands. Methods of identifying OR ligands are disclosed in U.S. Pat. 
10 No. 5,993,778. 

Since the above described probes and array enable typing of individuals 
according to the OR alleles carried or expressed thereby and since as is clearly shown 
in the Examples section which follows the prevalence of OR alleles varies from one 
population to another, the OR typing methodology described hereinabove can be 
15 utilized to determine an olfactory sensitivity and range of populations, subpopulations 
and even individuals. 

Cross-cultural and sexual differences in olfactroy functions have been 
demonstrated in the past. A number of researchers evaluated the olfactory sensitivity 
and preferences of natives from primitive tribes in the late nineteenth and early 
20 twentieth centuries. 

Lombroso and Carrara [(1896/1897) Atti. Soc. Romana Antropol. 4:103] 
presented a group o f Dinkas of the S udan with dilutions of clove oil ranging from 
1:200 - 1:50,000. These authors reported that recognition did not occur for 
concentrations lower than 1 :2000 and that three Dinkas were unable to recognize the 
25 material even at the highest concentrations. 

Myers [(1903) Brit. J. Psychol. 1:117] used aqueous solutions of camphor to 
evaluate the olfactory sensitivity of a group of Murray Islanders. In addition, these 
individuals were asked to describe the odors of perfumes and other scents and to 
elucidate whether they liked or disliked them. The average olfactory acuity of the 
30 islanders was reported as being slightly higher than that of Scottish control subjects 
and their likes and dislikes were noted as being quite similar. 

Using a Zwaardemaker olfactometer, Grijns [(1906) Arch. Physiol. 509:517] 
compared sensitivity of a small group of Javanese subjects to those of some 
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Europeans. He concluded that the Javanese were about twice as sensitive to the three 
test materials i.e., acetic acid, ammonia and phenol. 

Contemporary studies devoid of inherent limitations of the above-described 
studies [i.e., the use of deadly poisons (i.e., phenol) and trigeminal stimulative 
5 materials (i.e., ammonia and acetic acid as test compounds], suggest that cross-cultural 
differences in odor preferences are goverened by still unknown genetically determined 
factors which interact with the well-known influences of experience on small 
perception to produce a variety of chemosensory experiences [Schleidt (1981) J. 
Chem. Ecol. 7:19; Davies and Pangborn (1985) Odor pleasantness judgments 
10 compared among samples from 20 nations using microfragrances. Seventh Annual 
Meeting of the association for Chemoreception Sciences, Sarasota, Florida, April 24- 
28]. 

As is clearly shown in the Examples section which follows, the present 
inventors have demonstrated for the first time a possible genetic basis for olfactory 

15 sensitivity by defining distinct segregating genetic populations based on the OR 
sequence data of the present invention. 

The g eneration o f s uch d istinct g enetic p opulations p rovides e vidence that a 
strong relationship exists between sequence variability in ORs and odorant-specific 
olfactory threshold variability. 

20 Thus, the present finding serve as the basis for correlating between the 

genotype and phenotype of olfactory perception. For example, if an allelic variant 
(pseudogene) is a low frequency allele in a specific individual, it would likely 
correlate to a specific anosmia, since it is a low frequency functional disruption. 
Contrarily, a high frequency OR allele present in a specific population or 

25 subpopulation could indicate a specific hyperosmia, a high odorant sensitivity present 
in that particular population or subpopulation. Utilizing these and other guiding 
principles, the present OR typing approach can be utilized to elucidate the linkage 
between an individuals genotype and olfactory perception. 

Elucidation of such a genotypic-phenotypic relationship can be effected by 

30 testing human volunteers from diverse ethnic groups which show clear genotypic 
variations in at least one, preferably several or more preferably all of the OR SPGs 
(allelic variants) described herein for threshold sensitivity to several odorants. The 
threshold sensitivities towards each odorant are expected to form a distribution in 
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which its two ends will be determined as M hyposmic ff (low sensitivity) and 
"hyperosmic" (high sensitivity). In addition, the functionality of the different OR SPGs 
will be determined in each individual by SNP genotyping in a high-throughput 
manner. Statistical analysis will then be used to identify significant correlation 

5 between specific odorant sensitivity and a particular OR allele, which might also 
indicate a specific interaction between the two molecules. 

The capability of such approach to identify specific receptor-ligand interactions 
was recently demonstrated in t he taste sense where differences in sensitivity t o the 
bitter taste of Phenylthiocarbamide (PTC) were associated to a SNP in the taste 

10 receptor gene TAS2R38 [Kim U.K et al, Science (299) 1221-1225 (2003)]. Similar 
associations between specific smell sensitivities and OR genes would shed light on the 
tremendous human olfactory variability and might open new commercial opportunities 
in the flavor and fragrance (F&F) industry. 

A number of standard measurements are known in the art for testing olfactory 

15 function and evaluating threshold of odor detection. When co-applied and correlated, 
these methods provide a reliable measure of olfactory ability. 

The Connecticut tests employ butanol threshold and odor identification. The 
University of Pennsylvania Smell Identification Test (UPSIT) is an odor identification 
test. Another test, the olfactory evoked response, is used in research centers along with 

20 odor identification tests to evaluate aberrant olfaction with relation to neurologic 
disease. 

Butanol threshold test - The butanol threshold test used at the Connecticut 
Chemosensory Clinical Research Center involves a forced-choice test using an 
aqueous concentration of butyl alcohol in one sniff bottle and water in the other. The 

25 subject is asked to identify the bottle containing the odorant, with each nostril tested 
separately. After each incorrect response, the concentration of butanol is increased by 
a factor of 3 until the patient either achieves 5 correct responses or fails to correctly 
identify the bottle with 4% butanol. The detection threshold is recorded as the 
concentration at which the patient correctly identifies the butanol on 5 consecutive 

30 trials. The scoring relates the patient's threshold to a normal subject population. 

Connecticut odor identification test - The odor identification test used at the 
Connecticut Chemosensory Clinical Research Center involves 10 items separately 
presented to each nostril in opaque jars. The items include 7 odorants, including baby 
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powder, chocolate, cinnamon, coffee, mothballs, peanut butter, and soap. The test also 
includes 3 trigeminal stimulants. The subject is given a list of 20 items with the 10 
stimuli and 10 other names as distractors and is asked to choose the name of the 
stimulus from this list. If the patients choice is incorrect, a second chance is given to 
5 correctly identify the item. The function score is derived from the number of odorants 
correctly identified, and it relates the patient's performance to a normal control 
group's performance. The performances on the butanol threshold and the odor 
identification tests are averaged to determine a composite function score. 

University of Pennsylvania Smell Identification Test (UPSIT)- The UPSIT 

10 involves ^microencapsulated odors in a scratch-and-sniff format, with 4 response 
alternatives accompanying each odor. The subject takes the test alone, with 
instructions to guess if not able to identify the item. Anosmic patients tend to score at 
or near chance (10/40 correct). The scores are compared against sex- and age-related 
norms, and the results are analyzed. This test has excellent test-retest reliability. 

15 The 3 above tests, given together and correlated, can provide a reliable 

measure of olfactory ability. Two more tests can be used; however, they are less 
reliable. 

Cross-cultural Smell Identification Test - This variant of UPSIT, which can 
be given in 5 minutes, was proposed for a quick measure of olfactory function. The 

20 12-item Cross-Cultural Smell Identification Test (CC-SIT) was developed using input 
on the familiarity of odors in several countries, including China, Colombia, France, 
Germany, Italy, Japan, Russia, and Sweden. The odorants chosen include banana, 
chocolate, cinnamon, gasoline, lemon, onion, paint thinner, pineapple, rose, soap, 
smoke, and turpentine. These odorants were identified most consistently by 

25 representatives from each country. This test is an excellent alternative for measuring 
olfactory function when time there is a time limitation, since it is rapid and reliable. 
The disadvantage of this test is that its brevity limits its sensitivity in detecting subtle 
changes in olfactory function. 

Olfactory evoked response - Olfactory evoked potentials are measured by 

30 electroencephalogram (EEG) electrodes and an electrooculogram to standardize the 
patient reaction to eye movements. A visual tracking task is performed to ensure 
constant alertness to the task, and headphones playing white noise are worn to mask 
auditory clues. Either carbon dioxide (no odor but a trigeminal stimulant) or hydrogen 
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10 



sulfide is delivered via an olfactometer to the nose in a constantly flowing air stream. 
Nl is the first negative peak measured, and P2 is the second positive trough. Latencies 
are measured to these 2 values. It will be appreciated, though, that in patients with 
neurologic disease, the UPSIT revealed abnormality more frequently than olfactory 
evoked responses. 

It will be appreciated that olfactory function can be tested using other 
methodologies. Tested subjects may be exposed to various odorants, including 
odorants from all 9 smell groups which include aromatic, fragrant, alliaceous (garlic), 
ambrosial (musky), hircinous (goaty), repulsive, nauseous, ethereal (fruity) and 
empyreumatic (roasted coffee). Such odorants are listed in Table 1, below. 

It will be further appreciated that since age is known to significantly interefere 
with the smell sense, age differences between subjects should be taken into 
consideration and more preferably subjects of similar age are tested. 



15 



Table 1 



Chemical Name 


Common Name 


Acetic acid 




5-Androst-l 6-en-3-one 




Benzaldehyde 


benzaldehyde 


2,3-Butanedione 


Diacetyl 


Butanoic acid ethyl ester 


Ethyl butyrate 


2-Butanone 


Ethyl methyl ketone 


1,8-Cineole 


Eucalyptol 


1 ,4-Diaminobutane 


Putrescine 


3 ,7-Dimethyl-2,6-octadienal 


Geranial, Citral A 


(E)-3,7-Dimethyl-2,6-octadien-l-ol 


Geraniol 


Ethyl-2-methylbutanoate 




|4-Ethylphenol 


p-Ethylphenol 


[Hexanoic acid 


Caproic acid 


(Hydrogen cyanide 


Hydrogen cyanide 


jHydrogen sulfide 


Hydrogen sulfide 


|4-Hydroxy-3-methoxybenzaldehyde 


Vanillin 


|3-(4-hydroxyphenyl)-2-butanone 


3-(para-Hydroxyphenyl)-2- 
butanone 


(isoamyl acetate 


Isoamyl acetate 


|lsoamyl proprionate 




|2-Methoxy-4-(2-propenyl)pheol 


Eugenol 
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2methoxy5methylpyrazine 




3 -Methyl- 1 -butanol 


Isoamyl alcohol 


3-Methyl-lH-indole 


Skatole 


5-Methyl-2-(l-methylethyl)cyclohexanone 


1-Menthone 


5-Methyl-2-(l-methylethyl)phenol 


Thymol 


2-Methyl-2-hexenoic acid 


2-Methyl-2-hexenoic acid 


2-Methyl-2-propanethiol 


tert-Butyl mercaptan 


2-Methyl-5-(l -methyl ethenyl)-2-cyclohexene-l - 
one 


i-v»/arvone i 


2-Methyl-5-(l -methylethenyl)-2-cyclohexene- 1 - 
one 


d-Carvone 


3-Methylbutanoic acid 


Isovaleric acid 


2-Methylpropanal 


Isobutryaldehyde 


3-(Methylthio)propionaldehyde 


Methional 


N,N-Dimethylmethanamine 


Trimethylamine 


3-Phenyl-2-propenal 


Cinnamaldehyde 


1 -Phenyl-3-methyl-3-pentanol 


Phenylethyl methyl ethyl carbinol 


2-Phenylethanol 


beta-Phenyl ethyl alcohol 


phenylisocyanide 


Phenylisocyanide 


2-Propene-l-sulfinothioic acid S-2-propenyl ester 


Allicin 


Pyridine 


Pyridine 


1-Pyrroline 




Triiodomethane 


Iodoform 


4-(2,6,6-Trimethyl-2-cyclohexen- 1 -yl)-3-buten-2- 
one and 4-(2,6,6-trimethyl-l-cyclohexen-l-yl)-3- 
buten-2-one 


alpha/beta Ionone 


w-Pentadecalactone 


w-Pentadecalactone 


Acetic acid; Ethanoic acid; 


Acetoin; 3-Hydroxy-2-butanone; 


Acetone; 2-Propanone; Propan-2-one; Dimethyl ketone; 


Acetophenone; Methyl phenyl ketone; 


Acetylpyrazine; 2-Acetylpyrazine; 


Acetylpyridine; 2-Acetylpyridine; 


Acetylpyrrole; 2-Acetylpyrrole; Methyl pyrrolyl ketone; 


Acetylthiazole; 2-Acetylthiazole; 
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Allyl sulfide; Diallyl sulfide; 



Ambrox®; (+)-Ambrox isomer; 



Ambrox®; (-)-Ambrox isomer; Ambroxan; 



Ambrox®; DL-Ambrox isomer; Synambran; 



Amyl alcohol; 1-Pentanol; Pentyl alcohol; 



Amyl butyrate; n-Pentyl butanoate; Amyl butanoate; 



Anisole; Methoxybenzene; 



Benzaldehyde; 



Benzenethiol; Thiophenol; Phenyl mercaptan; 



Benzothiazole; 



Benzyl alcohol; 



Bomyl acetate; 



Butanethiol; 1-Butanethiol; 



Butanone; 2-Butanone; Methyl ethyl ketone; 



Butyl acetate; n-Butyl acetate; 



Butyl alcohol; Butanol; 1-Butanol; n-Butanol; 



Butyl butyrate; Butyl butanoate; 



Butyl hexanoate; Butyl caproate; 



Butyl isobutyrate; n-Butyl 2-methylpropanoate; 



Butyl methylbutyrate; n-Butyl 2-methylbutyrate; 



Butyl propionate; n-Butyl propanoate; 



Butylamine; 1-Aminobutane; 



Butyraldehyde; Butanal; n-Butanal; 



Butyric acid; n-Butanoic acid; 
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Carvone; data for (-)-carvone 



Caryophyllene; beta-Caryophyllene; 



Citral; Geranial isomer 



Citral; Neral isomer 



Citronellol; (+)-Citronellol isomer 



Cresol; 2-Methylphenol; o-Cresol; 



Cresol; 3-Methyiphenol; m-Cresol; 



Cresol; 4-methylphenol; p-Cresol; 



Cyclocitral; beta-Cyclocitral isomer 



Damascenone; beta- Damascenone; 



Damascone; alpha- Damascone; 



Damascone; (+)-alpha-Damascone isomer; 



Damascone; (-)-alpha-Damascone isomer; 



Decadienal; trans,trans-2,4-Decadienal; 



Decalactone; gamma-Decalactone; 4-Decanolide; 



Decalactone; delta-Decalactone; 5-Decanolide; 



Decanal; Aldehyde C-10; Decyl aldehyde; 



Decanoic acid; Capric acid; 



Decenal; 2-Decenal; (E)-2-Decenal; 



Diacetyl; 2,3-dioxobutane; 



Dimethoxyphenol; 2,6-Dimethoxyphenol; Syringol; 



Dimethyl disulfide; Methyl disulfide; 



Dimethyl trisulfide; Methyl trisulfide; 



3,4-Dimethyl- 1 ,2-cyclopentanedione; 
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3, 5 -Dimethyl- 1 ,2-cyclopentanedione; 
2,5-Dimethyl-4-methoxy-3(2H)-furanone 

Dimethylpyrazine; 2,3-Dimethylpyrazine; j 
Dimethylpyrazine; 2,5 -Dimethylpyrazine; j 
Dimethylpyrazine; 2,6-Dimethylpyrazine; j 

i 

Dimethylthiazole; 4,5-Dimethylthiazole; 

Dimethyltrithiolane; 3,5-Dimethyl-l,2,4-trithiolane, 

Dodecalactone; gamma-Dodecalactone; 4-Dodecanolide; 

Dodecalactone; delta- Dodecalactone; 5-Dodecanolide; 

Dodecanal; Laurie aldehyde; Aldehyde C-12; Dodecyl aldehyde; 

Dodecanoic acid, Laurie acid; j 

Ethoxymethylpyrazine; 2-Ethoxy-3-methylpyrazine; 

Ethyl acetate; 

Ethyl acetoacetate; Acetoacetic acid, ethyl ester; 
Ethyl acrylate; 
Ethyl alcohol; 
Ethyl benzoate; 

Ethyl butyrate; Ethyl butanoate; 

Ethyl cinnamate; 

Ethyl heptanoate; 

Ethyl hexanoate; Ethyl caproate; 

Ethyl isobutyrate; Ethyl 2-methylpropanoate; 

Ethyl lactate; 

Ethyl methylbutyrate; Ethyl 2-methylbutyrate; 
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Ethyl 3-methylthiopropionate; 



Ethyl palmitate; Ethyl hexadecanoate; Ethyl cetylate; 



Ethyl phenylacetate; 



Ethyl propionate; Ethyl propanoate; 



Ethyl valerate; Ethyl pentanoate; 



Ethyl vanillin; 3-Ethoxy-4-hydroxybenzaldehyde; Ethavan; 
2-Ethyl-3,5-dimethylpyrazine; 



2-Ethyl-3,6-dimethylpyrazine; 



Ethyldimethylpyrazine; 3 -Ethyl-2,6-dimethylpyrazine; 



Ethylguaiacol; 4-Ethylguaiacol; 4-Ethyl-2-methoxyphenol; 



Ethylhexanol; 2-Ethyl-l-hexanol; 2-Ethylhexan-l-ol; 



Ethylhydroxymethylfuranone; 2-Ethyi-4-hydroxy-5-methyl-3(2H)furanone; 



Ethyl methoxypyrazine; 2-Ethyl-3-methoxypyrazine; 
Ethyl methylpyrazine; 2-Ethyl-5-methylpyrazine; 



Ethyl methylpyrazine; 3-Fthyl-2-methylpyrazine; 



Ethyl methylpyridine; 5-Ethyl-2-methylpyridine; 



Ethylpyrazine; 2-Ethylpyrazine; 



Eucalyptol; Cineole; 1,8-Cineole; 1,8-epoxy-p-menthane; 



Eugenol; 4-AUyl-2-methoxyphenol; 



Eugenyl methyl ether; Methyl eugenol; Methyl eugenol ether; 



Famesol; 



Formic acid; 



Furfural; 



Furfuryl alcohol; 



WO 03/091388 



PCT/IL03/00336 



29 



Furfuryl mercaptan; 



Furyl methyl ketone; 2-Furyl methyl ketone; 2-Acetylfiiran; 



Geraniol; trans-3,7- Dimethyl-2,7-octadien-ol; 



Geranyl acetate; 



Geranyl acetone; 6,10-Dimethyl-5,9-undecadien-2-one; 



Geranyl isobutyrate; Geranyl 2-methylpropanoate; 



Geranyl propionate; Geranyl propanoate; 



| Glycerol; Glycerin; 

II - — — - _ . — 

• Glycine; Aminoacetic acid; 

i 

! Guaiacol; o-Methoxyphenol; o-Hydroxyanisole, 



Heptalactone; gamma-Heptalactone; 4-heptanolide; 
| Heptanal; Aldehyde C-7; Heptaldehyde; Heptyl aldehyde; 



; Heptanoic acid; 

: Heptanone; 2-Heptanone; Methyl amyl ketone; 



; Heptenal, 4-Heptenal (cis and trans); 

|| 

i Heptenal; trans-2 -Heptenal; 



Heptenal; (E)-4-Heptenal; trans-4-Heptenal; 



Heptenal; (Z)-4-Heptenal; cis-4-Heptenal; 



Heptenone; 3-Hepten-2-one; (E)-3-Hepten-2-one 



Heptyl alcohol; 1-Heptanol; n-Heptanol; Alcohol C-7; 



Heptyl isobutyrate; Heptyl 2-methylpropanoate; 



Hexadienal; (E,E)-2,4-Hexadienal; trans,trans-2,4-Hexadienal; 



Hexalactone; gamma-Hexalactone; 4-Hexanolide; Hexan-4-olide; 



Hexanal; Aldehyde C-6; Caproic aldehyde; 
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; Hexanoic acid; Caproic acid; 

: Hexanol; 1-Hexanol; Hexyl alcohol; Caproic alcohol; Alcohol C-6; 

Hexenal; 2-hexenal; Hex-2-enal; (E)-2-hexenal; 

Hexenal; cis-3 -Hexenal; (Z)-3 -hexenal; 
I Hexenol; 3-Hexen-l-ol; cis-3-Hexenol;(Z)-3-Hexenol; 
| Hexyl acetate; 

li — — - ■ — - — 

i Hexyl butyrate; Hexyl butanoate; 

^ _ __. _ _ * 

j Hexyl isobutyrate; Hexyl 2-methylpropanoate; 

i Hexyl methylbutanoate; Hexyl 2-methylbutanoate; 

j Hexyl propionate; Hexyl propanoate; 

I 

! Hydroxydecadienoic acid lactone; 6-Pentyl-alpha-pyrone, 

i\ ■ — — . — * — _ — ■ " - 

I Hydroxydihydrotheaspirane; 6-Hydroxydihydrotheaspirane; 

{indole; 

i 

i lonone; alpha-lonone; 

i 

i lonone; beta-lonone; 

jlsoamyl acetate; 3-Methylbutyl acetate; 

'isoamyl alcohol, 3-Methyl-l-butanol; Isopentyl alcohol; 

jlsobutyl acetate; 

jlsobutyl alcohol; 2-Methyl- I -propanol; 

jlsobutyl isobutyrate; 2-Methylpropyl 2-methylpropanoate; 
i °™~ ' ~~~~ ■ "~ ■ 

! Isobutyimethoxypyrazine; 2-Isobutyl-3-methoxypyrazine; 

j 

; Isobutylmethylpyrazine; 2-Isobutyl-3-methylpyrazine; 
Isobutylthiazole; 2-lsobutylthiazole; 



Isobutyraldehyde; 2-Methylpropanal; 
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Isobutyric acid; 2-Methylpropanoic acid; 



Isovaleric acid; 3-Methylbutanoic acid; 



Jasmine lactone; Dec-7-en-5-olide; 



Limonene; d-Limonene; 



Linalool; 



■ Maltol; Veltol ; Corps praline; 



; 5-Ethyl-3-hydroxy-4-methyl-2(5H)-fiiranone; Maple furanone; 
Menthenethiol; l-p-Menthene-8-thiol; 



iMenthone; p-Menthan-3-one; 
2-Methoxy-3-isopropylpyrazine; 



; 2-Methoxy-5-isopropylpyrazine; 



:Methoxymethylphenol; 2-Methoxy-4-methylphenol; Creosol; 



; 2-Methoxy-3-sec-butylpyrazine; 
;2-,5 or 6-Methoxy-3-methylpyrazine; 



1 2-Methoxy-3-methylpyrazine; 
; 5-Methoxy-2-methylpyrazine; 
Methoxypyrazine; 2-Methoxypyrazine; 



Methoxyvinylphenol; 2-Methoxy-4-vinylphenol; 4-Vinylguaiacol; 



Methyl butyrate; Methyl butanoate; 



Methyl 2-furylmethyl disulfide; 



Methyl heptanoate; 



Methyl hexanoate, Methyl caproate; 



Methyl isobutyrate; Methyl 2-methyl propanoate; 



Methyl mercaptan; 
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Methyl methylbutyrate; Methyl 2-methylbutyrate; 



Methyl 3-methylthiopropionate; 



Methyl octanoate; Methyl caprylate; 



Methyl 1-propenyl disulfide; 



Methyl salicylate; 



Methyl sulfide; Dimethyl sulfide; Methylthiomethane; 



! Methyl valerate; Methyl pentanoate; 



4-Methylacetophenone; p-Methylacetophenone; 



Methylbutyl acetate, 2-Methylbutyl acetate; 



Methylbutyraldehyde; 2-Methylbutyraldehyde; 



! Methylbutyraldehyde; 3 -Methylbutyraldehyde; Isovaleraldehyde; 
|Methylbutyric acid; 2-Methylbutyric acid; 



;Methylcyclopentenolone; Cyclotene; Ketonarome; Corylone; MCP; 

i| — 
; 2-Methyl-3-(furfurylthio)pyrazine 



: 2-Methyl-5-(furfurylthio)pyrazine 



Methyl heptadienone; 6-Methyl-3,5-heptadien-2-one; 



Methylhep tenol ; 6-Methyl- 5 -hepten-2-ol ; 



' Methylheptenone; 6-Methyl-5-hepten-2-one; 
' 2-Methyl-4-propyl-l ,3-oxathiane; 



(+)-cis-2-Methyl-4-propyl- 1 ,3-oxathiane; 



(-)-cis-2-Methyl-4-propyl-l,3-oxathiane; 



Methylpyrazine, 2-Methylpyrazine; 



4-Methyl-5-thiazoleethanol; Sulfurol; 



Methylthioacetaldehyde; 2-Methylthioacetaldehyde; 
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Methylthiomethylpyrazine (mixture of isomers); 



2-Methylthio-3-methylpyrazine; 



5-Methylthio-2-methylpyrazine; 



Methylthiophencarboxaldehyde; 2-Fonnyl-5-methylthiophene; 



Methylthiopropanal; 3-(Methylthio)-propanal; Methional; 



Myrcene; 



Myristaldehyde; Tetradecanal; Aldehyde C-14 (Myristic); 



Myristic acid; Tetradecanoic acid; 



Nerol; 



Nonadienal; (E,Z)-2,6-Nonadienal; trans,cis-2,6-Nonadienal; 



Nonadienal; (E,E)-2,4-Nonadienal; trans,trans-2,4-Nonadienal; 



Nonanal; Nonyl aldehyde; Aldehyde C-9; 



Nonanoic acid; 



Nonanol; 1 -Nonanol; Nonyl alcohol; Alcohol C-9; 



Nonanone; 2-Nonanone; Methyl heptyl ketone; 



Nonenal; 2-Nonenal; 



Nonenal; cis-6-Nonenal; 



Nootkatone; (+)-Nootkatone (the natural isomer) 



Nootkatone; (-)-Nootkatone (the unnatural isomer) 



Octalactone; delta-Octalactone; 5-Octanolide; 



Octalactone; gamma-Octalactone; 4-octanolide, 



Octanal; Caprylic aldehyde; Aldehyde C-8; 



Octanoic acid; Caprylic acid; 



Octanol; 1-Octanol; Octyl alcohol,- Alcohol C-8; 
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Octanone; 2-Octanone; 



Octanone; 3-Octanone; 



Octenal; 2-Octenal; 



Octenol; l-Octen-3-ol; 



Octenone; l-Octen-3-one; 



Octyl acetate; 



Octyl isobutyrate; Octyl 2-methylpropanoate; 



Palmitic acid; Hexadecanoic acid; 



jPentadecalactone; omega-Pentadecalactone; 15-Pentadecanolide; 



Pentanone; 2-Pentanone; 



Pentenal; 2-Pentenal; 



Pentenol, l-Penten-3-ol; 



Pentenone, l-Penten-3-one; Ethyl vinyl ketone; 



Pentenone, 3-Penten-2-one; 



Pentylfuran; 2-Pentylfuran; 



Pentylpyridine; 2-Pentylpyridine, 



Phenethyl alcohol; 2-Phenethyl alcohol; 



Phenol; 



Phenylacetaldehyde; 



Phenylacetic acid; 



Pinene; alpha-Pinene; 



Pinene; beta-Pinene; 



Piperidine; 



Piperonal; Heliotropine; 
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.Propanediol; 1-Propanethiol; n-Propyl mercaptan; 



Propanol; 1-Propanol; Propyl alcohol; 



Propenyl propyl disulfide; Propyl propenyl disulfide; 



Propeny 1 guaethol ; 2-Ethoxy- 5 -propenylphenol ; 



Propionaldehyde; Propanal; 



'Propionic acid; Propanoic acid; 

; Propyl butyrate; Propyl butanoate; 

! Propyl propionate; Propyl propanoate; 

jPyrazinyl methyl sulfide; 2-(Methylthiomethyl)-pyrazine; 



i Pyridine; 

i 

i Pyrrole; 



{Pyrrolidine; 

I . — 

j Quinoline; 



! Raspberry Ketone; 4-(p-Hydroxyphenyl)-2-butanone; Oxanone, 
|Rose oxide; 4-Methyl-2-(2-methylpropen-l-yl)-tetrahydropyran; 
jSinensal; alpha-Sinensal; 



i Sotolon; Caramel furanone; 



! Stearic acid; Octadecanoic acid; 



Strawberry furanone; 4,5-Dimethyl-3-hydroxy-2(5H)-furanone; 



Styrene; Vinylbenzene; 



Terpineol; alpha-Terpineol; p-Menth-l-en-8-ol; 



' Terpinolene; p-Menth- 1 ,4(8)-diene; 



Tetramethylpyrazine, 2,3,5,6-Tetramethylpyrazine; 



j Thiamine hydrochloride; (value for thiamine pure) 
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Thymol, 5-Methyl-2-isopropylphenol; 



Trimethylamine; 



2,2,6-Trimethylcyclohexanone, 



Trimethyloxazoline; 2,4,5-Trimethyl-3-oxazoline; 



Trimethyl pyrazine; 2,3,5-Trimethylpyrazine, 



Trimethylthiazole; 2,4,5-Trimethylthiazole; 



Undecalactone; delta-Undecalactone; 5-Undecanolide; 



'Undecanal; Aldehyde C-ll (undecylic); 



Undecanoic acid; Undecylic acid; 



Unclecanone; 2-Undecanone; Methyl nonyl ketone; 



Valeraldehyde; Pentanal; 



Valeric acid; Pentanoic acid; 



Vanillin, 4-Hydroxy-3-methoxybenzaldehyde; 



Vinylphenol; 4-Vinylphenol; 



Once a correlation between an individuals (or population) genotype and 
phenotype is established, such a correlation can be used to match fragrance and flavors 
to specific consumers, consumer groups, subpopulations and populations 
5 (pharmacogenomics of olfaction). Moreover, identification of specific receptor- 
odorant interactions can offer ways to more efficiently design and deliver pleasing 
scents and flavors. Technologies for modulating odor response can utilize information 
uncovered using the present methodology to alter the olfactory response by molecular 
means (receptors agonists and antagonists) or to block or enhance the perception of 
10 specific smells. 

In addition to being usefiill in olfactory typing, the OR alleles of present 
invention and information derived therefrom can also be utilized in fertility testing. 

Recent studies have shown that several disctinct ORs are also predominantly or 
exclusively expressed in human spermatogenic cells [Parmentier (1992) Nature 
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355:453; Vanderhaeghen (1993) J. Cell Biol. 123:1441]. Immunocytochemistry 
indicates that resepctor proteins are localized to the sperm flagellar midpiece 
[Vanderhaeghen (1993) J. Cell Biol. 123:1441]. These observations have led to 
speculation that ORs may also function in chemosensory signaling pathways and 
5 hence in direct sperm chemotaxis. 

Recently Spehr and co-workers have cloned and functionally expressed a 
testicular OR, termed hOR17-4. Spehr and his colleagues exposed hOR17-4 to a 
number of different chemicals to determine which activated this protein. This study 
uncovered that in the presence of bourgeonal, some human sperm became activated 
10 and began to move toward the source of the chemical, indicating that ORs may also 
govern sperm movement towards the egg. Spehr noted that researchers have 
discovered between 20 and 40 olfactory receptors which, like hOR17-4, are localized 
to testicular tissue. 

In view of the above findings, it is highly likely that the olignucleotides and/or 
15 antibodies of the present invention can be used to detect/diagnose male infertility, 
particularly that associated with deficiency in sperm motility or to detect specific 
odorants which may suppress or enahnce sperm motility. Only recently has it been 
shown that sperm motility parameters are important for both presenting the maximum 
number of male gametes to the egg as well as facilitating penetration through its zona 
20 pellucida. Sperm motility parameters were also found to have high correlation with 
fertilization rates in vitro [Mahadevan and Trounson (1984) Fertil. Steril. 24: 131-4]. 

Detecting or diagnosing male infertility is typically effected using a sperm 
sample (Le. 9 semen) obtained from the subject tested. Semen can be collected by any 
method which is generally used for that species. For example, bovine and rabbit 
25 semen is typically collected by use of an artificial vagina. Human semen is typically 
collected by manual ejaculation. Methods of extracting proteins or nucleic acids from 
biological samples and methods of probing such samples with the oligonucleotides 
and/or antibodies of the present invention are described hereinabove. 

To facilitate diagnosis the oligonucleotides and antibodies generated according 
30 to the teachings of the present invention can be included in a diagnostic kit. These 
reagents can be packaged in a one or more containers with appropriate b uffers and 
preservatives and used for diagnosis. 
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Preferably, the containers include a label. Suitable containers include, for 
example, bottles, vials, syringes, and test tubes. The containers may be formed from a 
variety of materials such as glass or plastic. 

In addition, other additives such as stabilizers, buffers, blockers and the like 
5 may also be added. 

Additional objects, advantages, and novel features of the present invention will 
become apparent to one ordinarily skilled in the art upon examination of the following 
examples, w hich a re n ot i ntended t o b e 1 imiting. A dditionally, e ach o f the v arious 
10 embodiments and aspects of the present invention as delineated hereinabove and as 
claimed in the claims section below finds experimental support in the following 
examples. 

EXAMPLES 

15 Reference is now made to the following examples, which together with the 

above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures utilized 
in the present invention include molecular, biochemical, microbiological and 
recombinant DNA techniques. Such techniques are thoroughly explained in the 

20 literature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et 
ah, (1989); "Current Protocols in Molecular Biology" Volumes I-m Ausubel, R. M., 
ed. (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and 
Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", 
John Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", Scientific 

25 American Books, New York; Bin-en et al (eds) "Genome Analysis: A Laboratory 
Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); 
methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 
5,192,659 and 5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-HI 
Cellis, J. E., ed. (1994); "Culture of Animal Cells - A Manual of Basic Technique" by 

30 Freshney, Wiley-Liss, N. Y. (1994), Third Edition; "Current Protocols in 
Immunology" Volumes Mil Coligan J. E., ed. (1994); Stites et al (eds), "Basic and 
Clinical Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell 
and Shiigi (eds), "Selected Methods in Cellular Immunology", W. H. Freeman and 
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Co., New York (1980); available immunoassays are extensively described in the patent 
and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 
3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 
3,984,533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521; 
5 "Oligonucleotide Synthesis" Gait, M. J., ed. (1984); "Nucleic Acid Hybridization" 
Hames, B. D., and Higgins S. J., eds. (1985); "Transcription and Translation" Haines, 
B. D., and Higgins S. J., eds. (1984); "Animal Cell Culture" Freshney, R. L, ed. 
(1986); "Immobilized Cells and Enzymes" IRL Press, (1986); "A Practical Guide to 
Molecular Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, 

10 Academic Press; "PCR Protocols: A Guide To Methods And Applications", Academic 
Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein Purification and 
Characterization - A Laboratory Course Manual" CSHL Press (1996); "Approaches to 
Gene Mapping in C omplex Human D iseases" Jonathan L. Haines and Margaret A. 
Pericak-Vance eds., Wiley-Liss (1998); "Genetic Dissection of Complex Traits" D.C. 

15 Rao and Michael A. Province eds., Academic Press (1999); "Introduction to 
Quantitative Genetics" D.S. Falconer and Trudy F.C. Mackay, Addison Wesley 
Longman Limited (1996); all of which are i ncorporated b y reference as if fully set 
forth herein. Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided for the 

20 convenience of the reader. All the information contained therein is incorporated 
herein by reference. 



General Materials and Methods 
Population samples and DNA sequencing - 12 olfactory receptor (OR) coding 
25 regions and 3 OR introns (-1000 bp each) scattered along -400 Kb of the OR gene 
cluster (Figure 1) on human chromosome 17pl3.3 (15 Kb in total) were subjected to 
PCR amplification from 35 unrelated individuals consisting of 4 different 
ethnogeographic populations: 10 Ashkenazi Jews, 10 Yemenite Jews, 8 Bedouins, and 
7 Pygmies. 

30 PCR amplification - PCR reactions were carried out in a volume of 2 5 \x\, 

containing 0.2 |xM of each deoxynucleotide (Promega Corp., Madison, WI, USA), 50 
pMol of each primer, PCR buffer containing 1.5 DM MgCl 2 , 50 DM KC1, 10 DM 
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Tris-HCl pH 8.3, one unit of Taq DNA polymerase (Boehringer Mannheim, Germany) 
and 50 ng of genomic DNA. PCR reactions included an initial denaturation step of 3 
minutes at 94 °C, followed by 35 cycles of denaturation (1 minute at 94 °C), annealing 
(1 minute) at either 55 °C or 60 °C, extension (1 minute at 72 °C) and a final extension 
5 step of 10 minutes at 72 °C. PCR products were subjected to 1 % agarose gel 
electrophoresis, and were further purified using the High Pure PCR Product 
Purification Kit (Boehringer Mannheim, Germany). 

Sequencing analysis - Purified PCR products were subjected to dye- terminator 
cycle sequencing reactions (Perkin Elmer, Boston, Massachusetts, USA). Extension 

10 reactions were electrophoresed on an ABI 3700 automated DNA sequencer (Applied 
Biosystems, Foster City, CA USA). After base calling with the ABI Analysis 
Software (version 3.0), the analyzed data was edited using the Sequencher program 
(version 4.0, GeneCodes Corp., Ann Arbor, Michigan, USA). 

For each individual, a genomic segment of an approximately 1 kb which was 

15 sequenced from both ends was assembled using the Sequencher software to identify 
DNA polymorphisms. Sequencing was repeated for each genomic segment containing 
a singleton. 

Statistical analysis - Summary of the nucleotide variability was calculated for 
each ethnogeographic group using the Watterson's theta (Watterson, 1975), which 

20 is based on the number of segregating sites in the sample, and the nucleotide diversity 
7i (Nei and Li, 1979), which is the average number of differences between all pairs of 
sequences in the sample. The Tajima's Z)-test (Tajima, 1993) was used to estimate 
whether the frequency spectrum of alleles deviates significantly from the expectations 
of a standard neutral model. Tajima's D is positive in case of an excess of 

25 intermediate frequency alleles, and negative in case of an excess of rare alleles. 
Positive Tajima's D values may be caused by recent bottlenecks or balancing selection, 
while negative Tajima's D values may be caused by recent selective sweep, purifying 
selection or population expansion. 

Haplotype inference - The sequencing data reflected individuals heterozygotes 

30 for multiple DNA polymorphisms with ambiguous haplotype structure. To resolve the 
haplotype structure of each sample the Clark's haplotype subtraction algorithm (Clark, 
1990) was employed. The rationale of this algorithm is that homozygous haplotypes 
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are probably common and that a double heterozygote is likely to contain known 
common haplotypes. The Clark's algorithm is composed of three steps: 1) 
Identification of all unambiguous haplotypes (all homozygous and sequences with one 
heterozygous site) and considering them as 'resolved 1 . 2) Determination of whether 
5 each of the resolved haplotypes could be one of the alleles in the remaining ambiguous 
sequences. 3) In each case a possible phase of a double heterozygote is identified as 
one of the resolved ones, the phase is assumed to be known, and the remaining 
haplotype is added to the resolved haplotype set. This algorithm has been previously 
used and, in particular cases, has proven to be reliable by comparison with haplotypes 

10 obtained by direct molecular methods (Clark et al., 1 998; Rieder et al., 1999). 

To reveal the population substructure and patterns of linkage disequilibrium 
(LD), all rare variants (< 0.15) were excluded from the data before applying the 
algorithm. Then the algorithm was applied separately to the three major parts of the 
cluster (see Figure 1), allowing 6-12 overlapping SNPs between each pair of segments. 

15 Once all the haplotypes were resolved, the full-length haplotype was constructed using 
the overlapping SNPs and the subsets of the heterozygous sites were compared with 
the experimentally determined haplotype information of Gilad et al. (2000). 

Population stratification - The nearest neighbor statistic (S n n, Hudson, 2000) 
was used to test for population substructure. This method is a measure of how often a 

20 pair of nearest haplotypes (based on sequence similarity) belongs to the same 
ethnogeographical population group. The S nn value approaches unity when the 
populations at the two localities are highly differentiated and is 0.5 when the 
populations are part of the same panmictic population (Hudson, 2000). A permutation 
test is used to assess whether the S nn is significantly large for a particular sample, 

25 indicating that the populations at the two localities are differentiated. For genotype 
data in a small number of individuals with extensive recombination, this method was 
shown to perform better than alternative ones (Hudson, 2000). The commonly used F st 
statistic (Wright, 1951) was also calculated for all pairwise population groups. 

LD and recombination - The coefficient D' (Lewontin, 1964) was used as a 

30 measure of LD between polymorphic sites, using the Graphical Overview of Linkage 
Disequilibrium (GOLD) software (Abecasis and Cookson, 2000), applying a Fisher 
exact test (FET) for statistical significance. An alternative method used for pairwise 
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LD computation was the Expectation Maximization (EM) algorithm (Excoffier, 1995) 
using the Arlequin software (http://lgb.unige.ch/arlequin). 

EXAMPLE 1 

5 Population - specific nucleotide diversity within the human olfactory receptor (OR) 

gene cluster 

The olfactory receptor gene cluster on human chromosome 17pl3.3 was 
studied using SNP analysis in four distinct ethnogeographic populations: Ashkenazi 
Jews, Yemenite Jews, Bedouins and Pygmies. 

10 Experimental and statistical results 

Identification of SNPs within the OR gene cluster - SNP scoring was 
performed by sequencing of 12 OR coding regions and segments within three OR 
introns of 35 unrelated individuals from four disparate ethnogeographic origins: 10 
Ashkenazi Jews, 10 Yemenite Jews, 8 Bedouins, and 7 Pygmies. Genotyping was 

15 employed for each individual along the entire OR cluster. 

A total of 74 polymorphic sites were found, of which 31 were novel 
(http://bioinfo.weizmann.ac.il/~menashe/ORl 7JSNPs.html). Two of the SNPs 
identified, in pseudogenes OR1P1P and OR1E3P segregated between the 
pseudogenized and intact forms. Noteworthy, the variability within the same two ORs 

20 displayed significant deviations from Hardy- Weinberg equilibrium, whereby 7 out of 
10 SNPs (singletons excluded) were not at equilibrium (Table 2). 



Table 2 - Sequence diversity and Neutrality test for the OR gene cluster on 

Chromosome 17pl3.3 





n° 


5* 


Singletons 


n(%) 


0„(%) 


Tajima'sD 


P value (of 
Tajima D) 


Total 


35 


74 


19 


0.12 


0.10 


0.42 


>0.1 


Pygmies 


7 


57 


8 


0.14 


0.12 


0.88 


>0.1 , 


Bedouin 


8 


55 


6 


0.12 


0.12 


0.12 


>0.1 


Yemenite Jews 


10 


' 46 


4 


0.10 


0.09 


0.51 


>0.1 


Ashkenazi Jews 


10 


46 


1 


0.08 


0.08 


-0.32 


>0.1 


Average 


8.75 


51 


4.75 


0.11 


0.10 


0.30 




SD 


1.50 


5.83 


2.99 


0.03 


0.02 


0.52 





25 Population variability values are given for the total data set and for each population group separately, n 
is the nucleotide diversity, G is the population mutation rate. P values for Tajima's D are calculated 
using DnaSP v. 3.12. n a = sample size. S b = number of SNPs. 
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Nucleotide diversity - Distinct differences were seen among the human 
populations. The highest nucleotide diversity was found in the Pygmy population (n = 
0.14 %, Table 2), while the lowest v alue w as in the A shkenazi J ews (tt = 0.08 %, 
Table 1). Consequently the singletons were unequally distributed, with as many as 8 
5 in the African Pygmies and only 1 in the Ashkenazi Jews. These values are consistent 
with a historically small population size for Ashkenazi Jews and with the previously 
reported high variability in Africans (Hammer et al., 2000; Kobyliansky et al., 1982; 
Przeworski et al., 2000). 

SNPs frequencies do not deviate from the neutral model - The overall values 

10 of By, =15.2 (0.10 % per bp) and n = 0.12 % were found to be within the range 
previously reported for the presently studied cluster (Gilad et al., 2000) and elsewhere 
(Clark et al., 1998; Fullerton et al., 2000; Subrahmanyan et al., 2001). The overall 0 W 
was not significantly different from 19, the observed number of SNPs seen only once 
in the sample (singletons) (T-test, P = 0.36), as predicted by neutral expectations. 

15 The Tajima f s D statistic (Tajima, 1989) was computed to compare the observed 

frequency spectrum of SNPs to neutral model expectations (Table 2). The values for 
the entire data set, as well as for the individual populations, did not represent a 
statistically significant deviation from neutrality (P values > 0.1 in all cases, Table 2) 
although the Tajima's D values of the Pygmy population (£> = 0.88) and the Ashkenazi 

20 Jewish population (Z) = -0.32) present two extremes. 

Altogether, these data uncovered 31 new SNPs and confirmed 43 known SNPs 
within the OR gene cluster on chromosome 17pl3.3. These findings demonstrate 
distinct nucleotide diversities between the four different ethnogeographic groups with 
the highest value found in the African Pygmy population and the lowest value found in 

25 the Ashkenazi Jewish population. However, the observed frequency spectrum of 
SNPs did not represent a statistically significant deviation from neutrality. These 
results demonstrate a highly documented SNP mapping within the OR gene cluster on 
chromosome 17pl3.3 which can be used for haplotype analysis and linkage 
disequilibrium determination. 

30 
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EXAMPLE 2 

Linkage disequilibrium along the olfactory receptor (OR) gene cluster on 

chromosome 1 7pl3.3 
To calculate the level of linkage disequilibribium (LD) along the OR gene 
5 cluster within the four ethnogeographic groups the various haplotypes and 
recombination events were calculated. 

Experimental and statistic results 

Haplotype reconstruction - To calculate the LD, 40 polymorphic sites with 
intermediate frequencies (higher than 0.15) were subjected to haplotype re- 

10 construction using the Clark's algorithm as described in Methods hereinabove. Using 
this algorithm, 47 haplotypes from 30 individuals were successfully elucidated (Figure 
2). However, the algorithm failed to resolve the haplotypes of five individuals (14 %) 
due to ambiguities in their genotypes. However, this un-resolved haplotype fraction is 
not unexpected for the sample size used (Clark, 1990). Moreover, since no excess of 

15 heterozygous genotypes was seen 

(http://bioinfo.weizmann.ac.il/-menashe/OR17 SNPs.htmD these results probably do 
not reflect non-specific PCR amplifications of two highly similar ORs. 

Assuming neutrality, where haplotype frequency is governed only by genetic 
drift, and no recombination, the expected mean number of haplotypes for the observed 

20 variability values is 24 (Ewens, 1982). Therefore, the observed 47 haplotypes within 
30 individuals may reflect multiple recombination events in this gene cluster. Using 
the DnaSP package (Rozas and Rozas, 1995), the minimal number of recombination 
events (R m ) (Hudson and Kaplan, 1985) was calculated, assuming no recurrent 
mutations, and was found to be 18. 

25 Population subdivision - The haplotype distribution of the whole sample 

revealed a notable differentiation between the four ethnogeographic groups. As is 
shown in Figure 2, while most of the haplotypes composed of rare variants are found 
within the Pygmy population, the majority of segregating pseudogenes with increasing 
disruption level are found in the Ashkenazi Jewish population. 

30 The level of genetic differentiation of the subpopulations was further evaluated 

by applying the nearest-neighbor statistic (S nn ) (Hudson, 2000) to the haplotype data. 
According to this statistic, the S nn is expected to be near one when the populations at 
the two localities are highly differentiated and near one half when the populations at 
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the two localities are part of the same panmictic population. As is shown in Table 3, 
the highest S nn value is observed when the Pygmy population is compared with the 
Ashkenazi Jewish population, reflecting that these two populations are well 
differentiated. On the other hand, the S nn values observed when the Bedouin 
5 population is compared with the two Jewish populations (i.e., the Yemenite Jews and 
the Ashkenazi Jews) indicate at this particular OR cluster that the two Jewish 
populations are genetically closer to the Bedouins than to each other. These findings 
appear to correlate with the respective geographic distances, since the Eastern 
Mediterranean Bedouins are intermediates between the European Ashkenazi and the 
10 South Arabian Yemenites. 



Table 3 - Population subdivision: S„„ test 





Yemenite Jews 


Bedouin 


Ashkenazi Jews 


Ail 


Pygmies 


0.77 (P<0.01) 


0.79 (P<0.01) 


0.90 (P< 0.01) 


0.86 (P<0.01) 




Yemenite Jews 


0.73 (P = 0.01) 


0.83 (P< 0.01) 


0.82 (P<0.01) 






Bedouin 


0.69 (P = 0.02) 


0.74 (P = 0.02) 








Ashkenazi Jews 


0.79 (P<0.01) 



The nearest neighbor statistic values are given for the total dataset (All) and for 
15 pairwise group comparisons with the corresponding p values. 

The population substructure was further evaluated using the F st statistic 
(Wright, 1951). As is shown in Table 4 hereinbelow, the four ethnogeographic 
populations did not exhibit a significant differentiation from each other. This is 
20 probably due to the fact that the F st test has low power when small populations with a 
substantial recombination rate are compared. However, as with the S nn test, the 
comparison of the Pygmy population with the Ashkenazi Jewish population displayed 
the highest F st value, demonstrating that these two populations are highly 
differentiated from each other. 

25 

Table 4 - Population subdivision: F st test 





Yemenite Jews 


Bedouin 


Ashkenazi Jews 


Ail 


Pygmies 


0.17 


0.12 


0.27 


0.19 




Yemenite Jews 


0.01 


0.03 


0.02 






Bedouin 


0.04 


O.OO 








Ashkenazi Jews 


0.08 
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The Fst values are given for the total dataset (All) and for pairwise group 
comparisons. 

Determination of linkage disequilibrium (LD) - Based on the inferred 
haplotype information, the level of linkage disequilibrium, D' (Lewontin, 1964) was 
5 calculated for all pairs of sites in the sample. As is shown in Figure 3, SNPs scattered 
along the entire three regions within the OR gene cluster were aligned according to 
their physical order and the level of LD between them was determined. Since the 
Clark's method tends to reduce the number of haplotypes and thus may inflate the 
estimate of LD, two different statistic tests were employed to assess the significance of 

10 the LD. T hese are the Fisher Exact Test (FET) and the Expectation Maximization 
(EM) algorithm (Excoffier, 1995). Both tests were applied directly to the genotype 
data, using the same frequency cutoff and the same 30 individuals. As is further 
shown in Figure 3, the similarity between the two algorithms was extremely high for 
the first two regions of the cluster (regions I and II), with respectively 94.3 % and 98.6 

15 % of the results found to be in agreement. In the third region, the concordance level 
was only 64.8 %. In this case, the haplotype-based method is preferred. The poor 
performance of the EM algorithm in this region i s probably due to deviations from 
Hardy-Weinberg equilibrium in the two OR pseudogenes in this region. The 
inferiority of the direct method in this region is revealed through discrepancies in 

20 intragenic haplotypes as previously determined using cloning of PCR products (Gilad 
et al., 2000). 

Determination of LD decay along the entire OR gene cluster on chromosome 
17pl3.3 - To evaluate the LD decay along the entire OR region the D' values were 
plotted against pairwise physical distance. As is shown in Figure 4a, the LD decay to 

25 D' = 0.5 for the entire dataset was seen at a distance of 124 Kb (R = -0.53, P < 0.01). 
However, since the entire dataset was composed of four highly differentiated 
populations, the LD decay was further analyzed and compared between the four 
ethnogeographic groups. These analyses revealed significant decays to £>' = 0.5 for 
the Pygmy population at a distance of 210 Kb (R = -0.24, P < 0.01, Figure 4b). On the 

30 other hand, the Bedouins and both Jewish populations exhibited high level of LD with 
no evidence of decay at distances of 320 Kb (R = -0.16, p < 0.01, Figure 4d), 400 Kb 
(R = -0.11, p = 0.08, Figure e) and 1.1 Mb (R = -0.01, P = 0.41, Figure 4c) for the 
Bedouins, Ashkenazi Jews and Yemenite Jews, respectively. The minimal 



WO 03/091388 



PCT7IL03/00336 



47 

recombination values varied roughly in accordance and were Rm = 12, 11, 5 and 6 for 
the Pygmies, Bedouins, Yemenite Jews and Ashkenazi Jews. In addition, the four 
ethnogeographic groups exhibited different distribution of LD across regions (data not 
shown). However, given the small sample sizes, these differences in spatial patterns 
5 of LD may only be taken as indicative and not statistically proven. 

Altogether these results demonstrate that the OR gene cluster on chromosome 
17pl3.3 exhibits a particularly slow decay of LD, whereby D' decreases to 0.5 at an 
average distance of 124 Kb, and no decay is observed when the Pygmies are omitted 
from the analysis. These results support previous findings demonstrating a 
10 significantly higher level of LD in non- African samples (Reich et al., 2001; Frisse et 
al., 2001). 

Therefore, these results constitute a solid basis for an association study 
between genetic variations within the OR gene cluster on human chromosome 17pl3.3 
and specific olfactory phenotypes. The conspicuous population substructure and the 
15 long stretches with significant LD found within these populations strongly suggest the 
use of only a few markers within this cluster for the study of this kind. 

EXAMPLE 3 

Sequencing of five olfactory receptor pseudogenes revealed that three of the 
20 analyzed OR pseudogenes segregate between intact and disrupted forms. 
Experimental and statistic results 

Segregating pseudogenes in the entire sample population - The sequence 
analysis included five OR pseudogenes. Two of which (OR1E3P and OR1P1P) have 
an o pen r eading frame i nterrupted a t o nly o ne p osition and leading t o a p otentially 

25 inactive olfactory receptor. The coding region of OR1E3P is interrupted by a single 
base deletion (nucleotide coordinate 54 of SEQ ID NO: 84; see, Tables 5 and 6) that 
causes a frame shift and results in a premature stop codon. The coding region of 
OR1P1P is interrupted by a nonsense mutation (T->A at nucleotide coordinate 553 of 
SEQ ID NO: 85; see, Tables 5 and 6). Sequencing of 35 individuals from the four 

30 ethnogeographic groups (Pygmies, Bedouins, Yemenite Jews and Ashkenazi Jews) 
revealed that these two mutations in OR coding regions were polymorphic in the entire 
sample. The single base deletion of OR1E3P (causing a truncated protein) was absent 
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in 12/70 (17 %) of the chromosomes, and the nonsense mutation in OR1P1P was not 
seen in 14/70 (20 %) of the chromosomes. 

A third pseudogene, OR3A1, has a single nucleotide substitution (G-»A at 
coordinate 374 of SEQ ID NO: 89; see, Tables 5 and 6) which yields an amino acid 

5 replacement (Arginine 125 to Glutamine) in the DRY motif, a highly conserved 
position in ORs and other G-protein-coupled receptors (GPCRs) (Probst et al., 1992). 
This residue has been suggested to play a crucial role in signaling-related 
conformational changes in 7-helix receptors (Alewijnse et al., 2000). Therefore, this 
mis-sense mutation (R125Q) may reflect a third pseudogene, potentially leading to 

10 receptor inactivation. The mis-sense mutation was found to segregate within the entire 
sample population whereas the intact R125 form present in 48 % of all chromosomes. 

Population - specific segregating pseudogenes - When the fraction of 
segregating pseudogenes was analyzed and compared between the different 
ethnogeographic groups, unequal disposition of all three segregating pseudogenes was 

15 found (Figure 2). In all cases, the intact form of these genes was significantly more 
frequent in individuals of Pygmy origin than in individuals from other origins (Figure 
2). Inspection of the haplotypes for these three SNPs revealed 7 out of the expected 8 
haplotypes. Four of the chromosomes contained the a 11-intact haplotype, including 
one homozygote pygmy individual. As is further shown in Figure 2, the more intact 

20 haplotypes (labeled in green) were found among the Pygmies, whereas, the more 
disrupted haplotypes (marked in red) were found in the Yemenite Jews, Bedouins and 
Ashkenazi Jews. The highest level of haplotypes disruption was found in the 
Ashkenazi Jews, where 84 % of the haplotypes were completely damaged (Figure 2). 

These results demonstrate three putative SNP-related functionally segregating 

25 pseudogenes. It is likely that a functional loss at a given OR locus results from a 
homozygous state of the pseudogene, Le. 9 that olfactory dysfunction is a recessive trait 
(Whissell-Buechy and Amoore, 1973). Thus, individuals carrying at least one intact 
gene variant may have an extended olfactory ability compared to individuals who 
carry two pseudogene variants in the same locus. Accordingly, ten of the thirty 

30 individuals in Figure 2 might be hyposmic or anosmic in relation to holding both 
disrupted alleles in all three segregating OR loci. At the other extreme, three members 
of the sample are expected to be functionally intact at all three loci. Other individuals 



WO 03/091388 




PCT/IL03/00336 



49 

constitute intermediate cases with one or two potential functional disruptions. These 
findings therefore suggest that different human individuals have different 
chemosensory repertoires. 

Altogether, these results further suggest the use of these OR pseudogenes in a 
5 genotype-phenotype correlation study. 

EXAMPLE 4 

Characterization of additional novel polymorphic olfactory receptor genes 
A large-scale SNP mapping of the olfactory receptor loci was performed in 189 
10 individuals from different ethnic origins. 
Materials and Methods 

DNA samples - Human genomic DNA was obtained from 28 unrelated 
anonymous individuals who donated blood at the Israeli Blood Bank, 30 unrelated 
individuals, provided by the National Laboratory for Genetics of Israeli Population at 

15 Tel Aviv University (10 Ashkenazi Jews, 10 Bedouins, 10 Yemenite Jews) and 131 
samples provided by the Coriell Cell Repositories, Camden, NJ. The latter consisted 
of 9 Aboriginal Taiwanese, 7 Russians, 5 Mexicans, 5 Brazilians, 6 Asians and 99 
African American. Genomic DNA from two chimpanzees (Pan troglodytes) was 
isolated from whole blood. 

20 OR loci and SNPs selection - A consensus sequence for each OR family was 

generated using an alignment of 1227 intact OR sequences from human and mouse. 
These sequences were used as queries in tBLASTN searches against the Celera's 
human refSNP database. Each SNP was associated with a location within a particular 
OR by requiring that it resides on the same chromosome and that at least 33 amino 

25 acids from both sides of the SNP match exactly to the gene. SNPs were selected only 
if they unambiguously assigned to one intact OR gene and changed the open reading 
frame. 

Sequencing and SNP genotyping - DNA sequencing was performed on PGR 
products as described in the General Materials and Methods for Examples 1-3 
30 hereinabove. SNP genotyping employed the high-throughput mass-spectrometry SNP 
scoring system (Sequenom, San Diego, CA, USA). The segregating OR loci were 
validated by repeating their genotyping in all individuals. 
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Experimental and Statistics Results 

Screening for potential segregating pseudogenes (SPGs) in the olfactory 
receptor (OR) loci - Potential SPGs were screened using two approaches. In the first 
approach, OR pseudogenes that contain only one open reading frame disruption were 

5 included (Glusman et al., 2001). Fifty of these ORs were sequenced in chimpanzee, in 
which 33 of 50 were found to be intact suggesting that these sequences recently 
emerged in humans and thus further proving that these sequences are SPGs. In the 
second approach, the Celera's human single nucleotide polymorphism (SNP) database 
was searched for variations with potential to affect protein integrity. These included 

10 in-frame stop codons and mis-sense mutations in highly conserved amino acids with a 
potential to modify the olfactory receptor function (Schoneberg et al., 2002). This led 
to the identification of another 18 candidate segregating pseudogenes. 

Identification of SPGs among 189 ethnically diverse individuals - A total of 
51 OR loci were genotyped from 189 ethnically diverse individuals as detailed in 

15 Materials and Methods hereinabove. Twenty six of these ORs (Table 5) were found to 
segregate between the intact and disrupted alleles in the entire sample set resulting in a 
unique genotypic pattern for each of the tested individuals. Figures 5a-b summarize 
the various genotypic patterns of the segregating pseudogenes within the African- 
American (Figure 5a) and the non- African (Figure 5 b) populations. Assuming that 

20 olfactory dysfunction is a recessive trait (Whissell-Buechy and Amoore, 1973), a large 
diversity in the putative phenotypes can be deduced from the different genotypes. 
Altogether, a total of 178 putative phenotypes can be seen in the 189 individuals 
studied. Such a high level of documented inter-individual variability in a gene family 
is unprecedented, except in the case of the major histocompatibility complex (Yeager 

25 and Hughes, 1999). 

Non-Africans have significantly fewer intact ORs than African-Americans - 
The frequency of twelve intact OR alleles was compared between the 
Caucasian and Pygmy populations (Figure 6). The frequency of the null allele for two 
SPGs was equal in the two populations (ORlE3p and OrqD6p). For nine out of the 

30 other 10 SPGs, the null allele was found to be at a higher frequency in Caucasians than 
in Pygmies (Figure 6, two-tailed sign test, P < 0.021). Altogether, the frequencies of 
the null alleles were 13 % higher in the Caucasian population than in the Pygmy 
population. 
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A further analysis incorporating all 26 SPGs of the present invention (listed in 
Table 5) revealed that the frequency of genomes having fewer numbers of intact OR 
(i.e. 9 having more disrupted pseudogenes) was higher among the non- African 
population (Figure 7). These results are consistent with previous results presented in 

5 Example 3 hereinabove, demonstrating a higher fraction of intact OR genes among the 
Pygmies as compared with the Bedouins, Yemenite Jews and Ashkenazi Jews. 

Computation of the number of OR SPGs in the entire genome - The number 
of OR SPGs in the entire human genome was computed for 189 individuals by a two- 
step procedure. In the first step, based on the discovery of 1 1 OR SPGs among 50 

10 singly disrupted pseudogenes tested, it was estimated that for the total of 67 such 
genes in the HORDE database (http://bioinformatics.weizmann.ac.il/HORDEA there 
would be a proportionate number of 15 SPGs genome-wide. The assumption was that 
OR pseudogenes with more than one disrupting mutation are unlikely to harbor SPGs. 
In the second step, based on a sequencing depth of five chromosomes in the Celera 

15 SNP database (Venter et al., 2001), and assuming a neutral frequency spectrum for the 
disrupted derived alleles, the SPG count of 1 5 in 5 chromosomes extrapolates to 45 
SPGs in 378 chromosomes (189 individuals). Thus, a total of 60 SPGs was computed 
(i.e., 15+45). Using the same formulation, the count of SPGs with a frequency higher 
than 1 % was found to be 48. 

20 These numbers are in rough agreement with the reported count of different 

modes of specific anosmias, human odorant- specific sensory deficits (Amoore, 1974). 

Thus, the genotypic differences presented hereinabove might underlie at least 
some of the reported human phenotypic olfactory variations. Therefore, these results 
demonstrate the potential use of the OR SPGs for future association studies between 

25 individual OR disruptions and defined cases of odorant-specific olfactory threshold 
variability. 

Table 6 below lists oligonucleotide sequences which can be used to establish 
an odorant genotype for individuals or populations thus putatively classifying such 
individuals or populations according to their odorant sensitivity. 



30 
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Table 5 - Frequency of wild type (intact) and polymorphic(disrupted) olfactory 
receptor (OR) alleles in African Americans and Non-Africans 
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INS 


ft A< 
U.Oj 


ft A 7 
U.*» / 


ft SA 
U.JO 


ft K 


ft «»7 
U.J J 


ft AA 


OR4K3 


14+17.6 


ODEL 


622 


DEL 


f\ CI 

U.J t 


ft A7 
U.O/ 


ft <ft 

U.JO 


U.4V 


A 71 
U.J J 


ft AO 


OR5G3 


1 1+61.7 


ODEL 


373 


DEL 


0.00 


A A/1 


ft AA 
U.OO 


A 77 


A 7A 

U.JO 


A 7A 
U.J^ 


OR SI Fl n 
W rv_) trip 


1 1+15.1 


ODEL 


274 


DEL 


ft 5Q 
U.jV 


ft SI 
U.oJ 


ft 7 

U. / 


ft ill 


ft 17 
U. 1 / 


ft 7 

U.J 


OR2Jlp 


6+32.9 


OT 


574 


STOP 


ft ft? 
U.&Z 


ft SQ 

U.J7 


ft 71 
U. / I 


ft 1ft 


ft 4.1 
U.f 1 


ft 70 


OR52B4 

V/IVJ 4LTT 


1 1+15.1 


ODEL 


1 19 


DEL 


0.82 


0.63 


! 0.73 


0.18 


0.37 


0.27 


OR1B1 


9+106 


OT 


574 


STOP 


0.89 


0.59 


0.74 


0.11 


0.41 


0.26 


OR8K3 


11+61.7 


OT 


365 


(DRY) R>L 


0.78 


0.72 


0.75 


0.22 


0.28 


0.25 


OR52N4 


11+15.1 


A>T 


514 


STOP 


0.82 


0.72 


0.77 


0.18 


0.28 


0.23 


OR52H1 


1 1+15.1 


OA 


389 


(DRY) R>H 


0.92 


0.88 


0.9 


0.08 


0.12 


0.1 


OR6Q1 


11+64.1 


ODEL 


685 


DEL 


0.9 


0.94 


0.92 


0.1 


0.06 


0.08 


OR7C2 


19+17 


OA 


365 


(DRY) R>H 


0.87 


0.98 


0.92 


0.13 


0.02 


0.08 


OR51GI 


11+15.1 


G>A 


371 


(DRY) R>H 


0.96 


0.92 


0.94 


0.04 


0.08 


0.06 


OR5AL1p 


11+61.7 


CT>DEL 


421 


DEL 


0.96 


0.91 


0.94 


0.04 


0.09 


0.06 


OR10C1 


6+32.9 


OT 


163 


STOP 


0.93 


0.98 


0.95 


0.07 


0.02 


0.05 


OR2A9p 


7+149 


OT 


277 


STOP 


0.94 


0.96 


0.95 


0.06 


0.04 


0.05 



Table 6 - Oligonucleotide sequences useful in identifying both intact and 



Gene 
Name 


Intact oligo 


Pscudogenc oligo 


Oligonucleotide for primer extension 
assay 


ORI0C1 (79) 


GATGCTGCCCTCCAGTCCCCTATGT (1) 


GATGCTGCCCTCTAGTCCCCTATGT (27) 


TCCACTGATGCTGCCCTC (53) 


ORIOXlp (80) 


GGGTCTAACTTGGGTGGACAGGTCC (2) 


GGGTCTAACTTGAGTGGACAGGTCC (28) 


CCATCATGGGTCTAACTTG (54) 


ORUH7p(S\) 


AGAACTGTGCTTCAGGTTCCTTCTT (3) 


AGAACTGTGCTT rAGGTTCCTTCTT (29) 


CAGCTGAAGAAGGAACCT (55) 


OR12D2 (82) 


GGCATTTGACCGCTCTGTGGCTAT (4) 


GGCATTTGACCTCTCTGTGGCTAT (30) 


GCCGTGATGGCATTTGACC (56) 


OR1B1 (83) 


CGGCCACTTCTGCG AGCCTCTTGTT (5) 


CGGCC A CITCTGTG AGCCTCTTGTT (3 1 ) 


TGTCAGAACAAGAGGCTC (57) 


ORJE3p (84) 


CCTGGGCCTTCCCATCCAACCTGAG (6) 


CCTGGGCCTTCC ATCC A ACCTG AG (32) 


CCTGCTCCTGGGCCTTCC (58) 


ORlPlp (85) 


TGCTCTAAGATCCTTCCTGGCTTCTTC (7) 


TGCTCTTAGATCCTTCCTGGCTTCTTC (33 


AGAAGCCAGGAAGGATCT (59) 


ORIS! (86) 


TGGCCTATGACCGCTTTGTGGCGAT (8) 


TGGCCTATGACCACTTTGTGGCGAT (34) 


TGGCAGATCGCCACAAAG (60) 


OR2A9p(Sl) 


CTGCATGACACAGACCTTTCTCTTT (9) 


CTGCATGACATAGACCTTTCTCTTT (35) 


ACTCAAAAAGAGAAAGGTCT (61) 
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ORZ/Ip/W) 


TGTGTTGATACCCAGGCAAATGAGC (10) 


TGTGTTGATACCTAGGCAAATGAGC (36) 


GATTATCATGTGTTGATACC (62) 


OR3AJ (89) 


TGGCCTATGACCGATTCCTGGCCAT (11) 


TGGCCTATGACCAATTCCTGGCCAT (37) 


CGGCAGATGGCCAGGAAT (63) 


OR4A8p (90) 


ACCATCATGAATCGACAGGTTTGCA (12) 


ACCATCATGAATrGACAGGTTTGCA (38) 


GAAGGATGCAAACCTGTC (64) 


OR4K3 (91) 


GGCATGATTGCCCTGGTGTGCTTCA (13) 


GGCATGATTGCCTGGTGTGCTTCA (39) 


AGAGGATGAAGCACACCA (65) 


OR4Q2p (92) 


AAAACTTTTTTTGTGATGTTCCCC (14) 


AAAACTTTTTTTTGTGATGTTCCCC (40) 


ATCAGGTGGAAAAC 1 TTTTTT (66) 


OR5IFlp (93) 


TGGTTTGAGGCACGTGAAATCAGTCT (15) 


TGGTTTGAGGCAGTGAAATCAGTCT (41) 


ATCCTCTGGTTTGAGGCA (67) 


O/W/G/ (94) 


TGTCCATTGACCGCTACGTGGCCGT (16) 


TGTCCATTGACCACTACGTGGCCGT (42) 


TTATCCATGTCCATTGACC (68) 


ORS2B4 (95) 


CCTATGTCACCGCCCTTCTTGGGAA (17) 


CCTATGTCACCGCCTTCTTGGGAA (43) 


AGGCTGTTCCCAAGAAGG (69) 


0/?52/// (96) 


TGGCATTTGATCGCTATGTAGCTAT (18) 


TGGCATTTGATCACTATGTAGCTAT (44) 


TGGCCATGGCATTTGATC (70) 


OR52N4 (97) 


CTGCCCTACTGCAGAGGCAATATAC (19) 


CTGCCCTACTGCTGAGGCAATATAC (45) 


TGGGGAAGTATATTGCCTC (71) 


OR5AL!p(9S) 


ATTCCTAGAAAACTCTGTATTAAA (20) 


ATTCCTAGAAAACTGTATTAAA (46) 


GTCATTCTCATTCCTAGAAAA (72) 


OR5G3 (99) 


GAGTGTTTTCTCCTGGCTGCCATGG (21) 


GAGTGTTTTCTCTGGCTGCCATGG (47) 


GCAGCTGAGTGTTTTCTC (73) 


O/W//tf(100) 


CAACACTTTTACTGTGACATTATCC (22) 


CAACACTTTTACCGTGACATTATCC (48) 


TAACAATGGGATAATGTCAC (74) 


O/MC/ (101) 


ATCGTCTGGACACTGCTGCACATCC (23) 


ATCGTCTGGACATGCTGCACATCC (49) 


GGCAACATCGTCTGGACA (75) 


OA7C2 (102) 


CGGCCTATGACCGCTTCGTGGCCAT (24) 


CGGCCTATGACCACTTCGTGGCCAT (50) 


TAACAGATGGCCACGAAG (76) 


OK&/2p(103) 


TACTTTTTCCTCCGACACTTGGCTA (25) 


TACT I T T 1 CCTC I'G AC ACTTGGCTA (51) 


CCCCATGTACTTTTTCCTC (77) 


OR8K3 (104) 


TGTCCTACG ACCGCTATGTGGCCAT (26) 


TGTCCTACGACCTCTATGTGGCCAT (52) 


GTTACAGATGGCCACATAG (78) 



SEQ ID NO of ORs and oligonucleotide sequences indicated within parentheses 



It is appreciated that certain features of the invention, which are, for clarity, 
described in the context of separate embodiments, may also be provided in 
5 combination in a single embodiment. Conversely, various features of the invention, 
which are, for brevity, described in the context of a single embodiment, may also be 
provided separately or in any suitable subcombination. 

Although the invention has been described in conjunction with specific 
10 embodiments thereof, it is evident that many alternatives, modifications and variations 
will be apparent to those skilled in the art. Accordingly, it is intended to embrace all 
such alternatives, modifications and variations that fall within the spirit and broad 
scope of the appended claims. All publications, patents and patent applications 
mentioned in this specification are herein incorporated in their entirety by reference 
15 into the specification, to the same extent as if each individual publication, patent or 
patent application was specifically and individually indicated to be incorporated herein 
by reference. In addition, citation or identification of any reference in this application 
shall not be construed as an admission that such reference is available as prior art to 
the present invention. 
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WHAT IS CLAIMED IS: 

1. An oligonucleotide comprising a nucleic acid sequence selected 
suitable for identifying an olfactory receptor gene or an allelic variant thereof, said 
olfactory receptor gene being selected from the group consisting of SEQ ID NOs: 79- 
104. 

2. The oligonucleotide of claim 1, further comprising a detectable moiety 
attached to said nucleic acid sequence. 

3. The oligonucleotide of claim 2, wherein said detectable moiety is 
selected from the group consisting of a dye, a fluorophore, an enzyme, a ligand and a 
radioisotope. 

4. The oligonucleotide of claim 1, wherein the oligonucleotide is selected 
from the group consisting of SEQ ID NOs: 1-78. 

5. The oligonucleotide of claim 1, wherein the oligonucleotide is an SNP- 
specific oligonucleotide. 

6. The oligonucleotide of claim 1, wherein the oligonucleotide is a primer 
extension oligonucleotide. 

7. A kit for identifying an olfactory receptor gene and/or an allelic variant 
thereof, the kit comprising at least oligonucleotide having a nucleic acid sequence 
selected suitable for identifying the olfactory receptor gene and/or the allelic variant 
thereof. 

8. The kit of claim 7, further comprising reagents suitable for detecting 
identification of the olfactory receptor gene and/or the allelic variant thereof by said at 
least one oligonucleotide. 
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9. The kit of claim 7, further comprising packaging material identifying 
said at least oligonucleotide as being utilizable in detecting the olfactory receptor gene 
and/or the allelic variant thereof. 

10. The kit of claim 7, wherein the olfactory receptor gene is selected from 
the group consisting of SEQ ID NOs: 79-104. 

11. The kit of claim 7, wherein said at least one oligonucleotide is selected 
from the group consisting of SEQ ID NOs: 1-78. 

12. The kit of claim 7, wherein said at least oligonucleotide includes a 
detectable moiety attached to said nucleic acid sequence. 

13. The kit of claim 12, wherein said detectable moiety is selected from the 
group consisting of a dye, a fluorophore, an enzyme, a ligand and a radioisotope. 

1 4. An array for d etecting t he p resence o r absence o f a 1 1 east o ne allelic 
variant of an olfactory receptor gene in a subject, the array comprising at least one 
oligonucleotide being contained in or attached to a support, said at least 
oligonucleotide having a nucleic acid sequence selected suitable for specifically 
identifying the at least one allelic variant of the olfactory receptor gene. 

15. The array of claim 14, further comprising at least one additional 
oligonucleotide having a nucleic acid sequence selected suitable for specifically 
identifying the olfactory receptor gene. 

16. The array of claim 15, wherein the olfactory receptor gene is selected 
from the group consisting of SEQ ID NOs: 79-104. 

17. The array of claim 14, wherein said at least one oligonucleotide is 
selected from the group consisting of SEQ ID NOs: 27-78. 
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18. An array for typing a subject according to presence or absence of allelic 
variants of olfactory receptor genes, the array comprising a plurality of 
oligonucleotides e ach being attached to a support, said plurality of oligonucleotides 
include at least one typing oligonucleotide having a sequence selected suitable for 
specifically identifying presence or absence of a specific allelic variant of a specific 
olfactory receptor gene in the subject. 

19. The array of claim 18, wherein said plurality of oligonucleotides also 
include at least one reference oligonucleotide having a sequence selected suitable for 
specifically identifying said specific olfactory receptor gene. 

20. The array of claim 1 8, wherein said support is a chip. 

21. The array of claim 18, wherein said at least one typing oligonucleotide 
is selected from the group consisting of SEQ ID NOs: 27-78. 

22. The array of claim 19, wherein said at least one reference 
oligonucleotide is selected from the group consisting of SEQ ID NOs: 1-26. 

23. A method of typing a subject according to presence or absence of 
allelic variants of olfactory receptor genes, the method comprising detecting the 
presence or absence of at least one allelic variant of an olfactory receptor gene in a 
biological sample of the subject thereby typing the subject. 

24. The method of claim 23, further comprising detecting the presence or 
absence of said olfactory receptor gene in said biological sample of the subject. 

25. The method of claim 23, wherein the olfactory receptor gene is selected 
from the group consisting of SEQ ID NOs: 79-104. 

26. The method of claim 23, wherein said detecting the presence or absence 
of at least one allelic variant of said olfactory receptor gene is effected using at least 
one oligonucleotide selected from the group consisting of SEQ ID NOs: 27-78. 
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27. The method of claim 24, wherein said detecting the presence or absence 
of said olfactory receptor gene is effected using at least one oligonucleotide selected 
from the group consisting of SEQ ID NOs: 1-26. 

28. The method of claim 23, wherein said detecting the presence or absence 
of at least one allelic variant of said olfactory receptor gene is effected by detecting 
DNA and/or mRNA sequences. 

29. The method of claim 23, wherein said detecting the presence or absence 
of at least one allelic variant is effected using at least one o ligonucleotide selected 
from the group consisting of SEQ ID NOs: 27-78. 

30. The method of claim 24, wherein said detecting the presence or absence 
of said olfactory receptor gene is effected using at least one oligonucleotide selected 
from the group consisting of SEQ ID NOs: 1-26. 

31. A nucleic acid construct comprising a polynucleotide encoding an 
olfactory receptor gene and/or an allelic variant thereof, said olfactory receptor gene 
being selected from the group consisting of SEQ ID Nos: 79-104 and a promoter for 
directing transcription of said olfactory receptor gene or said allelic variant thereof in a 
cell. 

32. A cell comprising the nucleic acid construct of claim 3 1 . 



33. The cell of claim 31, wherein the cell is a mammalian cell. 




Fig. 2 
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SEQUENCE LISTING 



<110> 



Lancet, Doron 
Menashe, Idan 
Man , Orna 
Gilad, Yoav 



<120> POLYMORPHIC OLFACTORY RECEPTOR GENES AND ARRAYS, KITS AND METHODS 
UTILIZING INFORMATION DERIVED THEREFROM FOR GENETIC TYPING OF 
INDIVIDUALS 

<130> 26015 

<160> 104 

<170> Patentln version 3.2 

<210> 1 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 1 

gatgctgccc tccagtcccc tatgt 25 



<210> 2 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 3 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 3 

agaactgtgc ttcaggttcc ttctt 25 



<210> 4 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 5 
<211> . 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 2 

gggtctaact tgggtggaca ggtcc 



25 



<400> 4 

ggcatttgac cgctctgtgg ctat 



24 



<400> 5 
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2 



cggccacttc tgcgagcctc ttgtt 



25 



<210> 6 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 6 

cctgggcctt cccatccaac ctgag 25 



<210> 7 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 8 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 8 

tggcctatga ccgctttgtg gcgat 25 



<210> 9 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic dna oligonucleotide 



<210> 10 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 10 

tgtgttgata cccaggcaaa tgagc 25 

<210> 11 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 7 

tgctctaaga tccttcctgg cttcttc 



27 



<400> 9 

ctgcatgaca cagacctttc tcttt 



25 



<400> 11 

tggcctatga ccgattcctg gccat 



25 
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3 



<210> 12 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 12 

accatcatga atcgacaggt ttgca 25 



<210> 13 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 14 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 14 

aaaacttttt ttgtgatgtt cccc 24 



<210> 15 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Synthtic DNA oligonucleotide 



<210> 16 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 16 

tgtccattga ccgctacgtg gccgt 2S 



<210> 17 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 13 

ggcatgattg ccctggtgtg cttca 



25 



<400> 15 

tggtttgagg cacgtgaaat cagtct 



26 



<400> 17 

cctatgtcac cgcccttctt gggaa 



25 



<210> 18 
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4 



<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 18 

tggcatttga tcgctatgta gctat 25 



<210> 19 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 20 

attcctagaa aactctgtat taaa 24 



<210> 21 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 22 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 22 

caacactttt actgtgacat tatcc 25 



<210> 23 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 19 

ctgccctact gcagaggcaa tatac 



25 



<400> 21 

gagtgttttc tcctggctgc catgg 



25 



<400> 23 

atcgtctgga cactgctgca catcc 



25 



<210> 24 
<211> 25 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 24 

cggcctatga ccgcttcgtg gccat 25 

<210> 25 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 25 

tactttttcc tccgacactt ggcta 25 

<210> 26 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 26 

tgtcctacga ccgctatgtg gccat 25 

<210> 27 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 27 

gatgctgccc tctagtcccc tatgt 25 

<210> 28 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 28 

gggtctaact tgagtggaca ggtcc 25 

<210> 29 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 
<400> 29 

agaactgtgc tttaggttcc ttctt 25 

<210> 30 
<2ll> 24 
<212> DNA 

<213> Artificial sequence 
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6 



<220> 

<223> Synthtic DNA oligonucleotide 



<400> 30 

ggcatttgac ctctctgtgg ctat 



24 



<210> 31 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 31 

cggccacttc tgtgagcctc ttgtt 25 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 33 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 33 

tgctcttaga tccttcctgg cttcttc 27 



<210> 34 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 35 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 35 

ctgcatgaca tagacctttc tcttt 25 



<210> 36 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<400> 32 

cctgggcctt ccatccaacc tgag 



24 



<400> 34 

tggcctatga ccactttgtg gcgat 



25 



<220> 

<223> Synthtic DNA oligonucleotide 
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7 



<400> 36 

tgtgttgata cctaggcaaa tgagc 



25 



<210> 37 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 37 

tggcctatga ccaattcctg gccat 25 



<210> 38 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 39 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 39 

ggcatgattg cctggtgtgc ttca 24 



<210> 40 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 41 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 41 

tggtttgagg cagtgaaatc agtct 25 



<210> 42 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 38 

accatcatga attgacaggt ttgca 



25 



<400> 40 

aaaacttttt tttgtgatgt tcccc 



25 



<400> 42 
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8 



tgtccattga ccactacgtg gccgt 



25 



<210> 43 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 43 

cctatgtcac cgccttcttg ggaa 24 



<210> 44 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 45 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 45 

ctgccctact gctgaggcaa tatac 25 



<210> 46 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 47 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 47 

gagtgttttc tctggctgcc atgg 24 



<210> 48 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 44 

tggcatttga tcactatgta gctat 



25 



<400> 46 

attcctagaa aactgtatta aa 



22 



<400> 48 

caacactttt accgtgacat tatcc 



25 
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9 



<2X0> 49 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 49 

atcgtctgga catgctgcac atcc 24 



<210> 50 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 51 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 51 

tactttttcc tctgacactt ggcta 25 



<210> 52 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 53 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 53 

tccactgatg ctgccctc 18 



<210> 54 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 50 

cggcctatga ccacttcgtg gccat 



25 



<400> 52 

tgtcctacga cctctatgtg gccat 



25 



<400> 54 

ccatcatggg tctaacttg 



19 



<210> 55 
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10 



<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 55 

cagctgaaga aggaacct 18 



<210> 56 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 57 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 57 

tgtcagaaca agaggctc 18 



<210> 58 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 59 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 59 

agaagccagg aaggatct 18 



<210> 60 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<400> 56 

gccgtgatgg catttgacc 



19 



<400> 58 

cctgctcctg ggccttcc 



18 



<400> 60 

tggcagatcg ccacaaag 



18 



<210> 61 
<211> 20 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 
<400> 61 

actcaaaaag agaaaggtct 



<210> 62 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 62 

gattatcatg tgttgatacc 



<210> 63 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 63 

cggcagatgg ccaggaat 



<210> 64 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 64 

gaaggatgca aacctgtc 



<210> 65 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 65 

agaggatgaa gcacacca 

<210> 66 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 66 

atcaggtgga aaactttttt t 



<210> 67 

<211> 18 

<212> DNA 

<213> Artificial sequence 



11 



20 



20 



18 



18 



18 
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12 



<220> 

<223> Synthtic DMA oligonucleotide 



<400> 67 

atcctctggt ttgaggca 



18 



<210> 68 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 68 

ttatccatgt ccattgacc 19 



<210> 69 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 70 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 70 

tggccatggc atttgatc 18 



<210> 71 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 72 

gtcattctca ttcctagaaa a 21 



<210> 73 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<400> 69 

aggctgttcc caagaagg 



18 



<400> 71 

tggggaagta tattgcctc 



19 



<220> 

<223> Synthtic DNA oligonucleotide 
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13 



<400> 73 

gcagctgagt gttttctc 



<210> 74 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 74 

taacaatggg ataatgtcac 

<210> 75 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 75 

ggcaacatcg tctggaca 



<210> 76 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 76 

taacagatgg ccacgaag 



<210> 77 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 77 

ccccatgtac tttttcctc 



<210> 78 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthtic DNA oligonucleotide 

<400> 78 

gttacagatg gccacatag 



<210> 79 

<211> 936 

<212> DNA 

<213> Homo sapiens 

<400> 79 

atgagtgcaa acacctccat ggtgactgag tttcttcttc 



PCT/IL03/00336 



18 



20 



18 



18 



19 



19 



tcggcttctc ccacctggcc 60 



gacctccagg gcttgctctt ctctgtcttt ctcactatct acctgctgac cgtggcaggc 120 
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aatttcctca 


ttgtggtgct 


ggtctccact gatgctgccc tccagtcccc 


tatgtacttc 


180 


ttcctgcgca 


ccctctcggc 


cttggagatt ggctatacgt ctgtcacggt 


ccccctgcta 


240 


cttcaccacc 


tccttactgg ccggcgccac atctctcgct ctggatgtgc tctccagatg 


300 


ttcttcttcc 


tcttctttgg cgccacggag tgctgcctcc tggcagccat 


ggcctatgac 


360 


cgctatgcag 


ccatctgtga 


acccctccgc tacccactgc tgctgagcca 


ccgggtgtgt 


420 


ctacagctag 


ctgggtcggc gtgggcctgt ggggtgctgg tggggctggg 


ccacacccct 


480 


ttcatcttct 


ctttgccctt 


ctgcggcccc aataccatcc cgcagttctt 


ctgtgagatc 


540 


cagcctgtcc 


tgcagctggt 


atgtggagac acctcgctta atgaactgca 


gattatcctg 


600 


gcaacagccc 


tcctcatcct 


ctgccccttt ggcctcatcc tgggctccta 


cgggcgtatc 


660 


ctcgttacca 


tcttccggat 


cccatctgtt gcgggccgcc gcaaggcctt 


ctccacctgc 


720 


tcctcccacc 


tgatcgtggt 


ctccctcttc tatggcaccg cactctttat 


ctatattcgc 


780 


cctaaggcca 


gctacgatcc 


ggccactgac cctctggtgt ccctcttcta 


tgctgtggtc 


840 


acccccatcc 


tcaaccccat 


catctacagc ctgcggaaca cagaggtcaa 


agctgcccta 


900 


aagagaacca 


tccagaaaac 


ggtgcctatg gagatt 




936 



<210> 80 

<211> 974 

<212> DNA 

<213> Homo sapiens 

<400> 80 

atgaagatca accagacaat cctgaaggaa ttcattcttg ttggcttttc tgtgtaccca 60 

catgtacaga catttctttt tgtggtcttc ttttgtctct accttctcac ccttgcaggt 120 

aatctgacca tcatgggtct aacttgagtg gacaggtccc tccacacccc tatgtatctc 180 

ttccttagtg cactctcctt ctctgagacc tgctatacac tgaccatcgt ccccaagatg 240 

ctggaagatc tactggccaa ggacagaagc atttcagtca caggttgtag cttacagatg 300 

tgcttcttct tgggacttgg tggcacaaac tgtatcattc tcactttgat gggatatgac 360 

cgcttcctgg ccatttgtaa ccctctaaga tatccactgc ttatgaccaa cattgtatgt 420 

ggacaacttg tggcctctgc ttgcactgca ggcttcttta tctctcttac agagactgca 480 

ctgatattca gggactottt ctgcagaccc aaccttgtca aacacttctt ctgccatatg 540 

ctggcagtta ttaggctgtc ttgtatagac agtaaccaca cagaattcat tataacactg 600 

atctcagtgt ctggtttgct gggtaccctt ctgctcatca tcctgactga tgtcttcatt 660 

atttctactg tcctcaggat cccttcagct gagggcaagc agaaggcctt caccacctgt 720 

gcctcccacc tcaccgtggt tataatccac tttggttttg catctattgt ttatttgaag 780 

ccagaagcct caggagatga cacactcata gcagtccctt atactgtcat tacccccttc 84 0 

ctcagcccca tcatattcag cctgaggaat aaggacatga aaaatgcttt tagaagaatg 900 

atgggaaaca cagttgcctt gaaaaaataa tcttgggttg ttgctgcttg tttgaagaag 960 

ggctcaatgt cccc 9 74 



<210> 81 
<211> 954 
<212> DNA 
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<213> Homo sapiens 












<400> 81 
agtctgggaa 


gcatgaataa 


ctcacagata 


tctactgtga 


cgcagtttgt 


gttgttgggg 


60 


tttcctggtc 


cctggaaaat 


tcagatcatc 


tttttctcaa 


tgattttgtt 


ggtctacatc 


120 


ttcactctga 


ctgggaatat ggccatcatc tgtgcagtga ggtgggacca tcgactccat 


180 


acccctatgt 


acgtgctcct 


agccaacttc 


tccttcctag 


agatctggta 


tgtgacctgc 


240 


acagtcccca 


acatgctggt 


aaattttttc 


tccaaaacta 


agaccatatc 


attctctgga 


300 


tgtttcactc 


agttccactt 


cttcttttcc 


ctgggcacaa 


ctgaatgctt 


cttcctctgt 


360 


gtcatggctt 


atgatcggta 


cctggccatc 


tgccacccac 


tgcactatcc 


ctccattatg 


420 


actggccagc 


tctgtggcat 


cttggtgtct 


ctttgttggc 


tcattggttt 


ccttggacat 


480 


tcaatttcca 


ttttcttcat 


ttttcaacta 


cctttctgtg gtcccaacat 


cattgatcat 


540 


tttctgtgtg 


atgtagaccc 


actgatggca 


ttgtcctctg cccctactca catcataggg 


600 


catgtgttcc 


attctgtgag 


ctctcttttc 


atcaacctca 


ccatggtgta 


catccttggg 


660 


tcctatacct 


tggtgctcag 


aactgtgctt 


taggttcctt 


cttcagctgg 


atggcaaaag 


720 


gccatctcta 


cctgtgggtc 


acacttggtt 


gttgtgtctc 


tgttctatgg 


agccataatg 


780 


ctgatgtatg 


tgagtcccac 


acctggcaac 


tcagttgcta 


tgcataagct 


catcacactg 


840 


atatattctg 


tggtaacacc 


tgtcttaaac 


cccctcatct 


acagcctacg 


caacaaggac 


900 


atgaaatatg 


ccctccatca 


tgtcttctgt 


ggaatgagaa 


ttatccagag 


atca 


954 



<210> 82 

<211> 921 

<212> DNA 

<213> Homo sapiens 










<400> 82 
atgctgaata 


caacctcagt 


caccgaattt 


ctcctcttgg gagtgacaga 


cattcaagaa 


60 


ctgcagcctt 


ttctcttcgt 


ggttttcctc 


accatctact tcatcagtgt 


gactgggaat 


120 


ggagccgttc 


tgatgattgt 


catctccgat 


cctagactcc attcccttat 


gtatttcttc 


180 


ctgggaaacc 


tgtcctacct 


ggatatctgt 


tactctacgg tgacactgcc 


aaaaatgctg 


240 


cagaactttc 


tctctacaca 


caaagcaatt 


tctttcttgg gatgcataag 


ccagcttcat 


300 


ttcttccact 


tcctgggcag 


cacggagtcc 


atgttgttcg ccgtgatggc 


atttgacctc 


360 


tctgtggcta 


tctgcaagcc 


acttcgctac 


actgtcatca tgaaccctca 


gctctgtacc 


420 


cagatggcca 


tcacaatctg ggtcattggt 


tttttccatg ccctgctgca 


ctccgtaatg 


480 


acttctcgct 


tgaacttctg tggttccaac cgtatccatc attttctctg tgatattaag 


540 


ccattgctaa 


agctggcctg 


tgggaacact 


gagcttaatc agtggctact 


cagtactgtc 


600 


acggggacaa 


ttgccatggg 


ccccttcttt 


ctgacacttc tctcctattt 


ctacattatc 


660 


acttatctct 


tcttcaagac 


ccgttcttgt 


agcatgctct gtaaagcact gtccacttgt 


720 


gcctcccact 


tcatggtagt 


tattcttttc 


tatgcacctg ttcttttcac 


ctatatccat 


780 


cctgcgttag 


agagcttcat ggaccaggac cggattgttg ccatcatgta cactgtggtc 


840 


actcctgtac 


taaacccact 


gatctatact 


ttgaggaaca aggaagtgaa 


gggggccttg 


900 


ggtagagtga 


tcagaaggct 


t 






921 
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<210> 83 

<211> 9S4 

<212> DNA 

<213> Homo sapiens 

<400> 83 



atgatgagct 


ttgcccctaa 


tgcttcacac tctccggttt 


ttttgctcct 


tgggttctcg 


60 


agagctaaca 


tctcctacac 


tctcctcttc ttcctgttcc 


tggctattta 


cctgaccacc 


120 


atactgggga 


atgtgacact 


ggtgctgctc atctcctggg 


actccagact 


gcactcaccc 


160 


atgtattatc 


tgcttcgtgg 


cctctctgtg atagacatgg 


ggctatccac 


agttacactg 


240 


ccccagttgc 


tggcccattt 


ggtctctcat tacccaacca 


ttcctgctgc 


ccgctgcttg 


300 


gctcagttct 


ttttcttcta 


tgcatttggg gttacagata 


cacttgtcat 


tgctgtcatg 


360 


gctctggatc 


gctatgtggc 


catctgtgac cccctgcact 


atgctttggt 


aatgaatcac 


420 


caacggtgtg 


cctgcttact 


agccttgagc tgggtggtgt 


ccatactgca 


caccatgttg 


480 


cgtgtgggac 


tcgtcctgcc 


tctttgctgg actggggatg 


ctgggggcaa cgttaacctt 


540 


cctcacttct 


tttgtgacca 


ccggccactt ctgcgagcct 


cttgttctga 


catacattct 


600 


aatgagctgg 


ccatattctt 


tgagggtggc ttccttatgc 


tgggcccctg 


tgccctcatt 


660 


gtactctctt 


atgtccgaat 


tggggccgct attctacgtt 


tgccttcagc 


tgctggtcgc 


720 


cgccgagcag 


tctccacctg 


tggatcccac ctcaccatgg 


ttggtttcct 


ctacggcacc 


780 


atcatttgtg 


tctacttcca 


gcctcccttc cagaactctc 


agtatcagga 


catggtggct 


840 


tcagtaatgt 


atactgccat 


tacacctttg gccaacccat 


ttgtgtatag 


cctccacaat 


900 


aaggatgtca 


agggtgcact 


ctgcaqgctq cttqaatqqq 


tgaaggtaga 


cccc 


954 


<210> 84 

<211> 941 

<212> DNA 

<213> Homo sapiens 










<400> 84 
atgatgaaga 


agaaccaaac 


catgatctca gagttcctgc 


tcctgggcct 


tccatccaac 


60 


ctgagcagcg 


gaatctgttc 


tatgccttgt tcttggccgt 


gtatcttacc 


accctcctgg 


120 


ggaacctcct 


cgtcattgtc 


ctcattcgac tggactccca 


cctccacatg 


cctatgtatt 


180 


tgtgtctcag 


caacttgtcc 


ttctctgacc tctgcttttc 


ctcggtcaca 


atgcccaaat 


240 


tgctgcagaa 


catgcagagc 


caaaacccat ccatcccctt 


tgcggactgc 


ctggctcaga 


300 


tgtactttca 


tctgttttat 


ggagttctgg agagcttcct 


ccttgtggtc 


atggcttatc 


360 


actgctatgt 


ggctatttgc 


tttcctctgc actacaccac 


tatcatgagc 


cccaagtgtt 


420 


gccttggtct 


gctgacactc 


tcctggctgt tgaccactgc 


ccatgccacg 


ttgcacacct 


480 


tgcttatggc 


caggctgtcc 


ttttgtgctg agaatgtgat 


tcctcacttt 


ttctgtgata 


540 


catctacctt 


gttgaagctg 


gcctgctcca acacgcaagt 


caatgggtgg gtgatgtttt 


600 


tcatgggcgg 


gctcatcctt 


gtcatcccat tcctactcct 


catcatgtcc 


tgtgcaagaa 


660 


tcgtctccac 


catcctcagg 


gtcccttcca ctgggggcat 


ccagaaggct 


ttctccacct 


720 


gtggccccca 


cctctctgtg 


gtgtctctct tctatgggac 


aattattggt 


ctctacttgt 


780 


gcccattgac 


gaatcataac 


actgtgaagg acactgtcat 


ggctgtgatg 


tacactgggg 


840 
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tgacccacat gctgaacccc ttcatctaca gcctgaggaa cagagacatg agggggaacc 900 

ctgggcagag tcttcagcac aaagaaaatt tttttgtctt t 941 

<210> 85 

<211> 948 

<212> DNA 

<213> Homo sapiens 

<400> 85 

ctcctggaag gagggaatca gactagcacc tttgagttcc tcctctgggg actctcagac 60 

cagccacagc agcaacacat cttcttcctg ctgtttctgt ggatgtacgt ggtcactgtg 120 

gctgggaacc tgctcattgt cctggccatt ggcactgaca cacacctcca cacccctatg 180 

tacttcttcc ttgccagctt gtcgtgtgca gatatctttt ccacctccac cactgtgccc 24 0 

aaggccctgg tgaatatcca gacccagagc aggtccattt cctacgcagg gtgtttggca 300 

cagctctact tcttcttgac ttttggggac atggacatct ttctcccggc tacaatggcc 3 60 

tatgaccgct atgtggccat ttgccacctg ctccactata tgatgatcat gagcctccac 4 20 

cgctgtgcct tcctggtgac agcctgctgg accctcacaa gtcttctcgc catgactcgc 4 80 

accttcctca tattccggct ttccttgtgc tcttagatcc ttcctggctt cttctgtgat 540 

ttgggaccgc tgatgaaggt gtcttgctct gacgcccagg tcaatgagct tgtgctcctc 600 

ttcctagggg gagcagtcat tttaatccct tttatgctca tcctggtctc ttatatccgc 660 

attgtttcag ccatcctcag ggccccctct gcccagggaa ggcgcaaggc cttctctacc 720 

tgcgactctc acctcgttgt tgttgctttg ttctttggga cagtgatcag ggcttatctg 780 

tgcccctcat cctcttcctc caactcagta aaggaggata cagcggctgc tgtcatgtac 84 0 

acagtggtga ctcccctgct gaaccccttt atttacagca tgcggaacaa ggacatgaag 900 

gcggcggtgg ttagacttct caagggcagg gtctccttct cacagggc 948 

<210> 86 

<211> 975 

<212> DNA 

<213> Homo sapiens 

<400> 86 

atgaagactt ttagttcctt tcttcagatc ggcagaaata tgcatcaagg aaaccaaacc 60 

accatcactg aattcattct cctgggattt ttcaagcagg atgagcatca aaacctcctc 120 

tttgtgcttt tcttgggtat gtacctggtc actgtgattg ggaacgggct catcattgtg 180 

gctatcagct tggatacgta ccttcatacc cccatgtatc tcttccttgc caatctatcc 240 

tttgctgata tttcctccat ttccaactca gtccccaaaa tgctggtgaa tattcaaacc 300 

aagagtcaat ccatctctta tgagagctgc atcacacaga tgtacttttc tattgtgttt 360 

gtcgtcattg acaatttgct cttggggacc atggcctatg accactttgt ggcgatctgc 420 

caccctctga attatacaat tctcatgcgg cccaggttcg gcattttgct cacagtcatc 4 80 

tcatggttcc tcagtaatat tattgctctg acacacaccc ttctgctcat tcaattgctc 54 0 

ttctgtaacc acaacactct cccacacttc ttctgtgact tggcccctct gctcaaactg 600 

tcctgttcag atacattgat caatgagctt gtgttgttta ttgtgggttt atcagttatc 660 

atcttcccct ttacactcag cttcttttcc tatgtctgca tcatcagagc tgtcctgaga 720 
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gtatcttcca cacagggaaa gtggaaagcc ttctccactt gtggctctca cctgacagtt 780 

gtattactgt tctacggaac cattgtaggc gtgtactttt tcccctcctc cactcaccct 840 

gaggacactg ataagattgg tgctgtccta ttcactgtgg tgacacccat gataaacccc 900 

ttcatctaca gcttgaggaa taaggatatg aaaggtgccc tgagaaagct catcaataga 960 

aaaatttctt ccctt 975 

<210> 87 

<211> 993 

<212> DNA 

<213> Homo sapiens 

<400> 87 

atggtcacag agttcctcct actgggattt ctcctgggcc caaggattca gatgctcctc 60 

tttgggctct tctccctgtt ctatgtcttc accctgctgg ggaatgggac catcctgggg 120 

ctcatctcac tggactccag actccacacc cccatgtact tcttcctctc acacctggcc 180 

gtcgtcaaca tcgcctatgc ctgcaacaca gtgccccaga tgctggtgaa cctcctgcat 24 0 

ccagccaagc ccatctcctt tgctggctgc atgacataga cctttctctt tttgagtttt 300 

gcacatactg aatgcctcct gttggtgctg atgtcctacg atcggtacgt ggccatctgc 360 

caccctctcc gatatttcat catcatgacc tggaaagtct gcatcactct ggccatcact 420 

tcctggacat gtggctccct cctggctatg gtccatgtga gcctcatcct aagactgccc 4 80 

ttttgtgggc ctcgtgaaat caaccacttc ttctgtgaaa tcctgtctgt cctcaggctg 540 

gcctgtgctg atacctggct caaccaggtg gtcatctttg cagcctgcat gttcatcctg 600 

gtgggaccac tctgcctggt gctggtctcc tactcacaca tcctggcggc catcctgagg 660 

atccagtctg gggagggccg cagaaaggcc ttctccacct gctcctccca cctctgcgta 720 

gtgggactct tctttggcag cgccatcgtc atgtacatgg cccctaagtc ccgccatcct 780 

gaggagcagc agaaggtcct ttttctattt tacagttctt tcaacccgat gctaaacccc 840 

ctgatttaca acctgaggaa tgtagaggtc aagggtgccc tgaggagagc actgtgcaag 900 

gaaagtcatt cctaagaggt gtgacatttg aactgccagc ctcagttgtc acgtggactc 960 

ttgatgccca attattgcct caatccagaa aag 993 

<210> 86 

<211> 1011 

<212> DNA 

<213> Homo sapiens 

<400> 88 

atgaaaaaaa atgcaagttt tgaagacttc tttattctac ttggattttc taactggcct 60 

catctggaag tagttctctt tgtggttatc ttgatcttct acttgataac actgatagga 120 

aacctgttca tcatcatcct gtcatacctg gactcccatc tccacactcc catgtacttc 180 

ttcctttcaa atctctcatt tctggatctc tgctacacca ccagctctat ccctcagttg 240 

ctggtgaatc tctggggccc ggaaaagacc atctcttatg ctggttgtac agttcaactt 3 00 

tactttgttc tcgcactggg aaccgcagag tgtgtcctac tggtggtgat gtcctatgat 360 

cgttatgcag ctgtgtgtag acctttgcat tacactgtcc tcatgcaccc tcgtttctgc 420 

cgcttgttgg ctgcggcttc ttgggtaagt ggttttacaa cctcagcact tcattcctcc 480 
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tttactttct ggatacccct atgtagacat cgcctagtgg atcacttctt ctgtgaagct 540 

ccagcacttc tgcgattatc atgtgttgat acctaggcaa atgagctgac cctcatggtc 600 

atgagctcca tttttgttct catacctctc atcctcatcc tcacttccta tggtgccatt 660 

gcccgggctg tactgagcat gcaatcaacc actgggcttc agaaagtgct taggacatgt 720 

ggagcccatc ttatggttgt atctctcttt ttcattccag tcatgtgcat gtatctccag 780 

ccaccatcag aaaattctca agatcaaggc aagttcattg ccctctttta cactgttgtc 840 

acacctagtc ttaaccctct aatctacact ttcagaaaca aggatgtaag aggggcagtg 900 

aagagactaa tggggtggga atgggggatg tgacagggaa atcatgttgg ctgttgtttt 960 

tcctagggtc ttatccattt tgaaaggttg tttccctgct tctttgtgat t 1011 

<210> 89 

<211> 945 

<212> DNA 

<213> Homo sapiens 

<400> 89 

atgcagccag aatctggggc caatggaaca gtcattgctg agttcatcct gctgggcttg 60 

ctggaggcgc cagggctgca gccagttgtc tttgtgctct tcctctttgc ctacctggtc 120 

acggtcaggg gcaacctcag catcctggca gctgtcttgg tggagcccaa actccacacc 180 

cccatgtact tcttcctggg gaacctatca gtgctggatg ttgggtgcat cagcgtcact 240 

gttccatcaa tgttgagtcg tctcctgtcc cgcaagcgtg cagttccctg tggggcctgc 300 

cttacccagc tcttcttctt ccatctgttc gttggagtgg actgcttcct gctgaccgcc 360 

atggcctatg accaattcct ggccatctgc cggcccctca cctacagcac ccgcatgagt 420 

cagacagtcc agaggatgtt ggtggctgcg tcctgggctt gtgctttcac caacgcactg 4 80 

acccacactg tggccatgtc cacgctcaac ttctgtggcc ccaatgtgat caatcacttc 540 

tactgtgacc tcccacagct cttccagctc tcctgctcca gcacccaact caatgagctg 600 

ctgctttttg ctgtgggttt tataatggca ggtaccccca tggctctcat tgtcatctcc 660 

tatatccacg tggcagctgc agtcctgcga attcgctctg tagagggcag gaagaaagcc 720 

ttctccacat gtggctccca cctcactgtg gttgccatat tctatggttc aggtatcttt 780 

aactatatgc gactgggttc aaccaagctt tcagacaagg ataaagctgt tggaattttc 84 0 

aacactgtca tcaatcccat gctgaaccca atcatctaca gcttcagaaa ccctgatgtg 900 

cagagtgcca tctggaggat gctcacaggg aggcggtcac tggct 94 5 

<210> 90 

<211> 924 

<212> DNA 

<213> Homo sapiens 

<400> 90 

atgaggcaga ataacaatat tacagaattt gtcctcctgg gcttctctca gtatcctgat 60 

gtgcaaaatg cattatttgt catgttttta ctcatatata ttgtgactat ggtggggaac 120 

ctgctcattg tggtgtctat tattgccagt ccctttttgg gctccccagt gtacttcttc 180 

cttgcctgcc tgtcatttat agatgctgtg tattccacca ccatttctcc tgtattgatt 240 

gtagacttac tctgtgataa aaagactatt tccttcccag cttgcatggg tcagctattt 300 



# 



WO 03/091388 W PCT/IL03/00336 

20 

atagagcact tgtttggtga tactgacgtc ttccttctgg tggtgatggc ctatgatcgc 360 

tacgtggcca cctgtaagcc actgcgctat ttgaccatca tgaattgaca ggtttgcatc 420 

cttctgttgg tggtggctgt gactggaggt tttctgcatt ctgtgtttca aattttagtt 480 

gtgtacagtc tccctttctg tggccccaat gtcatttatc actttttctg taacatatac 54 0 

cctttattgg acctggaatg cactgacacc tacttcgtag gcctcgctgt ggttttcaat 600 

ggtggagcaa tctgtatggt catcttcacc cttctactaa tctcctatgg ggtcatccta 660 

aactccctta aaacttatag tccggaaggg aggcataaag ctccgtttat ctgcagctcc 720 

cactttatca tggttatctt gttttttgtt ccctgtattt tcttatatgt tagacccgtt 780 

tcaaactttc ctattgataa attcctgact gtgttttatt cagttatcac acccaagttg 84 0 

aatcctttta tatacatgtt gagaaattca gagatgagaa atgctataga aaatctcttg 900 

ggataccaaa gtgggaagac agga 924 

<210> 91 

<211> 945 

<212> DNA 

<213> Homo sapiens 

<400> 91 

atggcctgga gcaatcagtc tgcggtaacc gaattcatac tacggggtct gtccagttct 60 

ttagaactcc agattttcta cttcctgttt ttctccatag tctatgcagc cactgtgctg 120 

gggaaccttc ttattgtggt caccattgca tcagagccac accttcattc ccctacgtac 180 

tttctgctgg gcaatctctc cttcattgac atgtccctgg cctcatttgc cacccccaaa 24 0 

atgattgcag acttccttag agaacacaaa gccatctctt ttgaaggctg catgacccag 300 

atgttcttcc tacatctctt agggggtgct gagattgtac tgctgatctc catgtccttt 360 

gataggtacg tggctatctg taagcctcta cattacctaa caatcatgag ccgaagaatg 420 

tgtgttgggc ttgtgatact ttcctggatt gtcggcatct tccatgctct gagtcagtta 4 80 

gcatttacag tgaatctgcc cttctgtgga cccaatgaag tagacagttt cttttgtgac 540 

ctccctttgg tgattaaact tgcttgtgtc gacacatata ttctgggggt gttcatgatc 600 

tcaaccagtg gcatgattgc cctggtgtgc ttcatcctct tggtgatctc ttacactatc 660 

atcctggtca ccgttcggca gcgttcctct ggtggatcct ccaaagccct ctccacgtgc 720 

agtgcccact ttactgttgt gacccttttc tttggcccat gcactttcat ttatgtgtgg 780 

cctttcacaa atttcccaat agacaaagta ctctcagtat tttataccat atacactccc 840 

ctcttgaatc cagtgatcta taccgttagg aataaagatg tcaagtattc catgaggaaa 900 

ctaagcagcc atatctttaa atctaggaag actgatcata ctcct 945 

<210> 92 

<211> 910 

<212> DNA 

<213> Homo sapiens 

<400> 92 

atgagagaat ttttcttgtc agggttctca cagacaccat ctattgaagc agggctattt 60 

gtactatttc ttttcttcta tatgtccatt tgggttggca atgtcctcat catggtcaca 120 

gtagcatctg ataaatacct gaattcatca cccatgtatt tccttcttgg caacctctca 180 
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tttctggacc tatgttattc aacagtaacg acccctaagc ttctggctga cttctttaat 24 0 

catgaaaaac tcatttccta tgaccaatgc attgtgcaac tcttcttcct gcattttgta 3 00 

ggggcagctg agatgttcct gctcacagtg atggcgtacg atcgctatgt tgcaatctgt 360 

cgcccgctgc actacaccac tgtcatgagt cgggggttat gctgtgtgtt ggttgctgcc 420 

tcctggatgg gaggatttgt gcactccact gtccagacca ttctcactgt ccatctaccc 480 

ttttgtgggc caaatcaggt ggaaaacttt tttttgtgat gttccccctg tcatcaaact 540 

tgcttgtgct gacacttttg tcattgaatt gctcatggta tctaacagtg ggttgatctc 600 

caccatctcc tttgtggtgc tgatttcctc ctacaccact atcctagtca agattcgctc 660 

caaggaagga aggcgaaagg cactctccac gtgtgcctct cacctcatgg tggtaacact 720 

gttttttgga ccctgtattt tcatctacgc tcgtcctttc tctacatttt ctgtggacaa 780 

gatggtgtct gtactctaca atgttattac cccaatgcta aaccccctca tctacacact 84 0 

tcggaacaaa gaggtaaagt cagccatgca gaagctctgg gtcagaaatg ggcttacttg 900 

gaaaaagcag 9X0 

<210> 93 

<211> 968 

<212> DNA 

<213> Homo sapiens 

<400> 93 

atggaaatcc taagcaactc aacatctaaa tttccaacct tcttgttgac cggcattcct 60 

ggcctagagt ctgcccatgt ctggatctcc attcctttct gctgttttta tgccattgcc 120 

ctctctggga acagcgtgat cctgtttgtc atcattaccc agcagagtct ccatgaaccc 180 

atgtattatt tcctctccat gctatcagcc actgatctgg gcttgactgt ttcttcattg 240 

tcaacaacat taggtatcct ctggtttgag gcagtgaaat cagtctatac agctgcattg 300 

tccagatgtt ttttcttcat ggattcactt ttatggaatc tggagtgctg gtggctacag 360 

cctttgaccg ttatgtggcc atctgtgatc ctctgaggta cactaccatt ctcactaatt 420 

ccagaatcat tcaaatgggt cttctgatga ttacacgtgc tatagtacta atattaccac 480 

tacttttgct ccttaagcct ctctatttct gtagaatgaa tgccctttct cactcctatt 540 

gttaccatcc agatgtgatt caattagcat gttcagacat tcgggcaaat agcatctgtg 600 

gattaattga tctcatcctg accactggaa tagatacacc atgcattgtc ctgtcatata 660 

tcttaattat tcgctttgtc ctcagaattg cctcccctga agaatggcac aaggtcttca 720 

gcacctgtgt ctcccacgtg ggagcagttg ctttcttcta catccacatg ctgagcctgt 780 

ccttggtgta tcgctatggt cggtcagccc ccagagtagt ccattcagtg atggctaacg 840 

tatacctgct tttaccccct gtgctcaacc ccatcatcta cagtgtaaaa acaaaacaaa 900 

tccgcaaggc tatgctcagt ctgctgctta caaaatgaac agacatagtt ttatttgata 960 

caaacctg 968 

<210> 94 

<211> 963 

<212> DNA 

<213> Homo sapiens 

<400> 94 
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atgacaattc ttcttaatag cagcctccaa agagccactt tcttcctgac gggcttccaa 60 

ggtctagaag gtctccatgg ctggatctct attcccttct gcttcatcta cctgacagtt 120 

atcttgggga acctcaccat tctccacgtc atttgtactg atgccactct ccatggaccc 180 

atgtactatt tcttgggcat gctagctgtc acagacttag gcctttgcct ttccacactg 240 

cccactgtgc tgggcatttt ctggtttgat accagagaga ttggcatccc tgcctgtttc 300 

actcagctct tcttcatcca caccttgtct tcaatggagt catcagttct gttatccatg 360 

tccattgacc gctacgtggc cgtctgcaac ccactgcatg actccaccgt cctgacacct 420 

gcatgtattg tcaagatggg gctaagctca gtgcttagaa gtgctctcct catcctcccc 4 80 

ttgccattcc tcctgaagcg cttccaatac tgccactccc atgtgctggc tcatgcttat 540 

tgtcttcacc tggagatcat gaagctggcc tgctctagca tcattgtcaa tcacatctat 600 

gggctctttg ttgtggcctg caccgtgggt gtggactccc tgctcatctt tctctcatac 660 

gccctcatcc ttcgcaccgt gctcagcatt gcctcccacc aggagcgact ccgagccctc 720 

aacacctgtg tctctcatat ctgtgctgta ctgctcttct acatccccat gattggcttg 780 

tctcttgtgc atcgctttgg tgaacatctg ccccgcgttg tacacctctt catgtcctat 840 

gtgtatctgc tggtaccacc ccttatgaac cccatcatct acagcatcaa gaccaagcaa 900 

attcgccagc gcatcattaa gaagtttcag tttataaagt cacttaggtg tttttggaag 960 

gat 9 63 

<210> 95 

<211> 953 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (836) . . (836) 
<223> Any nucleotide 

<400> 95 

atgcctactg taaaccacag tggcactagc cacacagtct tccacttgct gggcatccct 60 

ggcctacagg accagcacat gtggatttct atcccattct tcatttccta tgtcaccgcc 120 

cttcttggga acagcctgct catcttcatt atcctcacaa agcgcagcct ccatgaaccc 180 

atgtacctct tcctctgcat gctggctgga gcagacattg tcctctccac gtgcaccatt 240 

cctcaggcct tagctatctt ctggttccgt gctggggaca tctccctgga tcgttgcatc 300 

actcagctct tcttcatcca ttccaccttc atctctgagt cagggatctt gctggtgatg 360 

gcctttgacc actatattgc catatgctac ccactgaggt acaccaccat tcttacaaat 420 

gctctgatca agaaaatttg tgtgactgtc tctctgagaa gttatggtac aattttccct 4 80 

atcatatttc ttttaaaaag attgactttc tgccagaata atattattcc acacaccttt 540 

tgtgaacaca ttggcctagc caaatatgca tgtaatgaca ttcgaataaa catttggtat 600 

gggttttcca ttctaatgtc gacggtggtc ttagatgttg tactaatttt tatttcctat 660 
atgctgattc tccatgctgt cttccacatg ccttctccag atgcttgcca caaagctctc 720 
aacacatttg gctcccatgt ctgcatcatc atcctctttt atgggtctgg catcttcaca 780 
atccttaccc agaggtttgg acgccacatt ccaccttgta tccacatccc gttggntaat 840 
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gtctgcattc tggctccacc tatgctgaat cccattattt atgggatcaa aaccaagcaa 900 

atccaggaac aggtggttca gtttttgttt ataaaacaga aataactttg gtt 9 53 

<210> 96 

<211> 960 

<212> DNA 

<213> Homo sapiens 

<400> 96 

atgccatctg cctctgccat gatcattttc aacctgagca gttacaatcc aggacccttc 60 

attctggtag ggatcccagg cctggagcaa ttccatgtgt ggattggaat tcccttctgt 120 

atcatctaca ttgtagctat tgtgggaaac tgcatccttc tctacctcat tgtggtggag 180 

catagtcttc atgaacccat gttcttcttt ctctccatgc tggccatgac tgacctcatc 240 

ttgtccacag ctggtgtgcc taaagcactc agtatctttt ggctaggggc tcgcgaaatc 300 

acattcccag gatgccttac acaaatgttc ttccttcact ataactttgt cctggattca 360 

gccattctga tggccatggc atttgatcac tatgtagcta tctgttctcc cttgagatat 420 

accaccatct tgactcccaa gaccatcatc aagagtgcta tgggcatctc ctttcgaagc 480 

ttctgcatca tcctgccaga tgtattcttg ctgacatgcc tgcctttctg caggacacgc 540 

atcatacccc acacatactg tgagcatata ggtgttgccc agctcgcctg tgctgatatc 600 

tccatcaact tctggtatgg cttttgtgtt cccatcatga cggtcatctc agatgtgatt 660 

ctcattgctg tttcctacgc acacatcctc tgtgctgtct ttggccttcc ctcccaagat 720 

gcctgccaga aagccctcgg cacttgtggt tctcatgtct gtgtcatcct catgttttat 780 

acacctgcct ttttctccat cctcgcccat cgctttggac acaatgtctc tcgcaccttc 84 0 

cacatcatgt ttgccaatct ctacattgtt atcccacctg cactcaaccc catggtttac 900 

ggagtgaaga ccaagcagat cagagataag gttatacttt tgttttctaa gggtacagga 960 

<210> 97 

<211> 963 

<212> DNA 

<213> Homo sapiens 

<400> 97 

atgctaacac tgaataaaac agacctaata ccagcttcat ttattctgaa tggagtccca 60 

ggactggaag acacacaact ctggatttcc ttcccattct gctctatgta tgttgtggct 120 

atggtaggga attgtggact cctctacctc attcactatg aggatgccct gcacaaaccc 180 

atgtactact tcttggccat gctttccttt actgaccttg ttatgtgctc tagtacaatc 240 

cctaaagccc tctgcatctt ctggtttcat ctcaaggaca ttggatttga tgaatgcctt 300 

gtccagatgt tcttcatcca caccttcaca gggatggagt ctggggtgct tatgcttatg 360 

gccctggatc gctatgtggc catctgctac cccttacgct attcaactat cctcaccaat 420 

cctgtaattg caaaggttgg gactgccacc ttcctgagag gggtattact cattattccc 4 80 

tttactttcc tcaccaagcg cctgccctac tgcagaggca atatacttcc ccatacctac 540 

tgtgaccaca tgtctgtagc caaattgtcc tgtggtaatg tcaaggtcaa tgccatctat 600 

ggtctgatgg ttgccctcct gattgggggc tttgacatac tgtgtatcac catctcctat 660 

accatgattc tccgggcagt ggtcagcctc tcctcagcag atgctcggca gaaggccttt 720 
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aatacctgca ctgcccacat ttgtgccatt gttttctcct atactccagc tttcttctcc 780 

ttcttttccc accgctttgg ggaacacata atcccccctt cttgccacat cattgtagcc 840 

aatatttatc tgctcctacc acccactatg aaccctattg tctatggggt gaaaaccaaa 9 00 

cagatacgag actgtgtcat aaggatcctt tcaggttcta aggataccaa atcctacagc 960 

atg 963 

<210> 98 

<211> 937 

<212> DNA 

<213> Homo sapiens 

<400> 98 

tcagtggcca aaggcaatca ttcaacagtg tatgaattta tcctcttggg gctcacagat 60 

aatgcagagc ttcaagtcac tctctttggt atattccttg tagtatactt agctagcttt 120 

atgggtaatt tcggtttgat tatgctaatt caaatcagtc ctcagcttca tacacccatg 180 

tattttttcc tcagccatct ggcttttgtt gatttttctt ttacttcatc tgttgcccca 24 0 

aataccttgg taaattttct gtgtgaagtt aaaagtataa cattttatgc atgtgccatt 300 

caggtatgct gcttcatcac atttgtagtt tgtgaattat atttgctctc aatcatggca 360 

tatgatcggt atgttgccat ctgtaaccct ttactttatg tcattctcat tcctagaaaa 420 

ctgtattaaa ctgattgcta gcacgtatgt gtatggattc actgtgggac ttgtacagac 480 

agtggcgaca tcctacttgt ctttttgtga ttccaacgtg atcaaccact tctaccatga 540 

tgatgttcca ttagtggctc tggcctgttc tgacactcat gtcaaagagc tgatgttgtt 600 

aatcattgct gggttcaata ctctctgctc tctagtaatt gtgctgattt cttatggttt 660 

cattttcttt gccatcctga ggatacattc tgctgaaggg agacagaaag cattttctac 720 

cagtgcttcc catctgacct ccatcacaat attttatgga acaatcattt ttatgtaccc 780 

gcagcccaag tcaagccatt ccctgaatat ggataaagtt gcttctgtgt ttaatgtggt 84 0 

agtgattcct acattaaacc cactgatcta tagtttaaga aatcaggagg taaaaaatgc 900 

actaaagaga attatagaaa agttatgttt ggctgtc 937 

<210> 99 

<211> 968 

<212> DNA 

<213> Homo sapiens 

<400> 99 

atgacagatt acaatgaacc aatggaacca atggaagata agaaccagac agtagtgact 60 

gaatttctct tattgggcct cacagatcat ccctatcaga agattgttct cttcttcatg 120 

tttctctttg tttatcttat caccctggga ggtaacttgg ggatgatcac tctcatatgg 180 

attgatccca gactccacac tcctatgtac ttttttctta ggcacttgtc ctttgtagat 240 

atttgttcct cttcttctgt tgtgcctaag atgctgtgta atatctttgc agagaaaaaa 3 00 

gacatcactt ttctgggttg tgctgcacag atgtggttct ttggtctctt tgaggcagct 360 

gagtgttttc tctggctgcc atggcatatg accggtatgt ggccatctgc aagcccttgt 420 

tgtatacgct cattatgtct cagcaggtct gtatgcagct ggtggtaggg ccttatgcca 480 

tggctcttat aagcaccatg actcatacaa ttttcacttt ttgcttaccc ttttgtggtt 540 
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caaatattat caatcacttt ttctgtgata tttttccact gctttcccta gcatgtgcag 600 

acacctgggt gaataaattt gtgctgtttg tcttggctgg agctatagga gtactcagtg 660 

gtctgatcat catggtctcc tatatttgca tcctgatgac catcttgaag atccagactg 720 

ctgatgggaa gcaaaaagct ttcttcacct gtttttctca ccttgcggct gtctccatcc 780 

tgtatgggac tcttttcttg atttatgttc ggccaagttc aagttcctcc ctgggtatct 840 

ataaagtgat ttctctattt tatactgtgg taatccccat ggttaacccc cttatttaca 9 00 

gcttgaggaa taaggaggtg aaagatgcat tcagaagaaa aattgagagg aaaaaattta 960 

ttataggt 968 

<210> 100 
<211> 927 
<212> DNA 
<213> Homo sapiens 

<400> 100 

atggaagagg aaaatgcaac attgctgaca gagtttgttc tcacaggatt tttacatcaa 60 

cctgactgta aaataccgct cttcctggca ttcttggtaa tatatctcat caccatcatg 120 

gggaatcttg gtctaattgt tctcatctgg aaagaccctc accttcatat cccaatgtac 180 

ttattccttg ggagtttagc ctttgtggat gcttcgttat catccacagt gactccgaag 240 

atgctgatca acttcttagc taagagtaag atgatatctc tctctgaatg catggtacaa 3 0O 

tttttttccc ttgtaaccac tgtaaccaca gaatgttttc tcttggcaac aatggcatat 360 

gatcgctatg tagccatttg caaagcttta ctttatccag tcattatgac caatgaacta 420 

tgcattcagc tattagtctt gtcatttata ggtggccttc ttcatgcttt aatccatgaa 480 

gctttttcat tcagattaac cttctgtaat tccaacataa tacaacactt ttactgtgac 540 

attatcccat tgttaaagat ttcctgtact gattcctcta ttaactttct aatggttttt 600 

attttcgcag gttctgttca agtttttacc attggaacta ttcttatatc ttatacaatt 660 

atcctcttta caatcttaga aaagaagtct atcaaaggga tacgaaaagc tgtctccacc 720 

tgtggggctc atctcttatc tgtatcttta tactatggcc ccctcacctt caaatatctg 780 

ggctctgcat ctccgcaagc agatgaccaa gatatgatgg agtctctatt ttacactgtc 84 0 

atagttcctt tattaaatcc catgatctac agcctgagaa acaagcaagt aatagcttca 900 

ttcacaaaaa tgttcaaaag caatgtt 927 

<210> 101 
<211> 951 
<212> DNA 
<213> Homo sapiens 

<400> 101 

atgcaaccat ataccaaaaa ctggacccag gtaactgaat ttgtcatgat gggctttgct 60 

ggcatccatg aagcacacct cctcttcttc atactcttcc tcaccatgta cctgttcacc 120 

ttggtggaga atttggccat cattttagtg gtgggtttgg accaccgact acggagaccc 180 

atgtatttct tcctgacaca cttgtcctgc cttgaaatct ggtacacttc tgttacagtg 240 

cccaagatgc tggctggttt tattggggtg gatggtggca agaatatctc ttatgctggt 300 

tgcctatccc agctcttcat cttcaccttt cttggggcaa ctgagtgttt cctactggct 360 



* 
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gccatggcct atgatcgtta tgtggccatt tgtatgcctc tccactatgg ggcttttgtg 420 

tcctggggca cctgcatccg tctggcagct gcctgttggc tggtaggttt cctcacaccc 480 

atcttgccaa tctacctctt gtctcagcta acattttgtg gcccaaatgt cattgaccat 540 

ttctcctgtg atgcctcacc cttgctagcc ttgtcgtgct cagatgtcac ttggaaggag 600 

actgtggatt tcctggtgtc tctggctgtg ctactggcct cctctatggt cattgctgtg 660 

tcctatggca acatcgtctg gacactgctg cacatccgct cagctgctga gcgctggaag 720 

gccttctcta cctgtgcagc tcacctgact gtggtgagcc tcttctatgg cactcttttc 780 

tttatgtatg tccagaccaa ggtgacctcc tccatcaact tcaacaaggt ggtatctgtc 840 

ttctactctg ttgtcacgcc catgctcaat cctctcatct acagtcttag gaacaaggaa 900 

gtgaagggag ctctgggtcg agtcttttct ctcaactttt ggaagggaca g 951 

<210> 102 
<211> 960 
<212> DNA 
<213> Homo sapiens 

<400> 102 

atggaaagag gaaaccaaac agaagttgga aactttctcc tcctgggatt cgcagaggac 60 

tctgacatgc agcttctcct ccatgggctg ttcctctcca tgtacctggt taccatcatc 120 

ggaaacctgc tcatcatcct gaccatcagt tcagactccc acctccacac ccccatgtac 180 

ttcttcctct ccaacctgtc ctttgctgac atctgtttca catccacgac tgtcccaaag 24 0 

atgctggtga atatccaaac acaaagcaaa atgatcactt ttgcaggctg cctcactcag 300 

atattttttt tcattgcatt tggatgcctg gacaatttgc tcctgaccat gacggcctat 360 

gaccgcttcg tggccatctg ttaccccctg cactacacgg tcatcatgaa cccccggctc 420 

tgtggactgc tggttctggg gtcctggtgc atcagtgtca tgggttcctt gcttgagacc 480 

ttgaccattt tgaggctgtc ctcctgcaca aatatggaaa ttccgcactt tttttgtgat 540 

ccttccgaag tcctgaagct ggcctgttct gacaccttca tcaataacat cgtgatgtgt 600 

tttgtgacca ttgtcctggg tgtttttcct ctctgtggaa tcctattctc ttattctcag 660 

attttctcct ccgtcctaag agtatcatct gccagaggcc agcacaaagc ctttaccacc 720 

tgtggttccc acctctcagt ggtcagcttg ttctatggca ctggccttgg ggtctatctc 780 

agttctgcag ttacaccacc ttctaggaca agtctggcag cctcggtgat gcacaccatg 840 

gtcaccccca tgctgaaccc cttcatctac agcctgagga acaaggacat gaaggggtca 9 00 

ctggggagac tcctcctcag ggcaacgtct ctcaaagagg ggaccattgc taagctctca 960 

<210> 103 
<211> 9S4 
<212> DNA 
<213> Homo sapiens 

<400> 103 

atgaagccaa caatacaaat ggcttcagga aatctcacat gggtgacgga gttcattctt 60 

gtgggagtct cagatgatcc ggagctccag attcccctct tcctggtctt cctggtgctc 120 

tatttgctga ccgtggcagg gaacctgggc atcatcaccc tcaccagtgt tgaccctcaa 180 

cttcaaaccc ccatgtactt tttcctctga cacttggcta ttattaatct ttgcaattct 240 
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actgtcgttg 


cccctaaaat 


gctggttaac ttcctggtta 


ccaagaaaac 


catatcatac 


300 


tatggatgtg 


cagcccaact 


gggtggattc ttggttttca 


ttgtggctga 


gattttcacg 


360 


ctggctgcaa 


tggcctatga 


ccgctatgtg gctatttgga 


gccctctgct 


ctacgccgta 


420 


gtggtgtctc 


caaaggtgtg tcgtctgctg gtgtccctca 


cataccttca 


gagtcttatc 


480 


acagcactga 


ctgtctcttc 


ctgtgtgttc tctgtgtcat 


actgttcttc 


caacattatc 


540 


aaccattttt 


actgtgatga 


tgtccctttg ctagcattgt 


cctgttctga 


tacctacatt 


600 


ccagaaacag 


cagtctttat 


cttttcaggg accaacttgc 


ttttctccat 


gatcgttgtt 


660 


ctgatatcct 


acttcaacat 


tgttattacc attttgagga 


tacgttcctc 


agaaggacga 


720 


caaaaagcct 


tttccacctg 


tgcttctcac atgatagctg tggttgtgtt 


ctatgggact 


780 


ctccttttca 


tgtatttgca 


accaaggagt aatcattcat 


tagatactga 


caaaatggct 


840 


tcggtcttct 


acaccctggt 


gataccagtg ctgaaccctc 


taatctacag 






aagaacgtga 


aggatgcact 


aaagaggttc ctagataacc 


catgccgatc 


actio 


oca 


<210> 104 
<211> 936 
<212> DNA 
<213> Homo sapiens 
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<400> 104 
atggaacaac 


acaatctaac 


aacggtgaat gaattcattc 


ttacgggaat 


cacagatatc 


60 


gctgagctgc 


aggcaccatt 


atttgcattg ttcctcatga 


tctatgtgat 


ctcagtgatg 


120 


ggcaatttgg 


gcatgattgt 


cctcaccaag ttggactcca 


ggttgcaaac 


ccctatgtac 


180 


ttttttctca 


gacatctggc 


tttcatggat cttggttatt 


caacaactgt gggacccaaa 


240 


atgttagtaa 


attttgttgt 


ggataagaat ataatttctt 


attatttttg 


tgcaacacag 


300 


ctagctttct 


ttcttgtgtt 


cattggtagt gaacttttta 


ttctctcagc 


catgtcctac 


360 


gacctctatg 


tggccatctg 


taaccctctg ctatacacag 


taatcatgtc 


acgaagggta 


420 


tgtcaggtgc 


tggtagcaat 


cccttacctc tattgcacat 


tcatttctct 


tctagtcacc 


480 


ataaagattt 


ttactttatc 


cttctgtggc tacaacgtca 


ttagtcattt 


ctactgtgac 


540 


agtctccctt 


tgttaccttt 


gctttgttca aatacacatg 


aaattgaatt 


gataattctg 


600 


atctttgcag 


ctattgattt 


gatttcatct cttctgatag 


ttcttttatc 


ttacctgctc 


660 


atccttgtag 


ccattctcag 


gatgaattct gctggcagac 


aaaaggcttt 


ttctacctgt 


720 


ggagcccacc 


tgacagtggt 


catagtgttc tatgggactt 


tgcttttcat 


gtacgtgcag 


780 


cccaagtcca 


gtcattcctt 


tgacactgat aaagtggctt 


ccatatttta 


caccctggtt 


84 0 


atccccatgt 


tgaatccctt gatctatagt ttacgaaaca 


aagatgtaaa 


atatgcccta 


900 


cgaaggacat 


ggaataactt 


atgtaatatt tttgtt 






936 



